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GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 



GCGAGCCCAA 


GCGGGCCTGC 


CCCTCAGTCG 


CGGATCCATC 


GACGTCAGAC 


GCGAGCCCAA 
GCGAGCCCAA 


GCGGGCCTGC 
GCGGGCCTGC 


CCCTCAGTCG 
CCCTCAGTCG 


CGGATCCATC 
CGGATCCATC 


GACGTCAGAC 
GACGTCAGAC 


GCGAGCCCAA 


GCGGGCCCGC 


CCCTCAGTCG 


CGGATCCATC 


GACGTCAGAC 




GCGAGCCCAA 
CTGTCACGTG 


GCGGGCCTGC 
AGTGCTTTTG 


CCCTCAGTCG 
CGACATTTTG 


CGGATCCATC 
C. . ATCCATC 


GACGTCAGAC 
GACGTCAGAC 



GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 



ATGAGCCCAA GCGGGCGCGC GAGTCAGTTG CGCAGCCATC GACGTCAGAC 

ATGAGCCCAA GCGGGCGCGC GAGTCAGTTG CGCAGCCATC GACGTCAGAC 

ATGAGCCCAA GCGGGCGCGC GAGTCAGTTG CGCAGCCATC GACGTCAGAC 

ATGAGCCCAA GCGGGCGCGC GAGTCAGTTG CGCAGCCATC GACGTCAGAC 

GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 

GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 

GTGAGCCCAA ACGGGTGCGC GAGTCAGTTG CGCAGCCATC GACGTCAGAC 

GCGAGCCAAA ACGGGAGTGC ACGTCACTTG CGCAGCCGAC AACGTCAGAC 

GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 

GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 

GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 

GCGAGCCCAA GCGGGCCTGC CCCTCAGTCG CGGATCCATC GACGTCAGAC 
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Fig. 1AM 



1901 

GCGGAAGGAG 
GCGGAAGGAG 
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CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 
CTCCGGTGGA 


CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 
CTTTGCCGAC 


AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 
AGGTACCAAA 


1950 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 
ACAAATGTTC 


GCGGAAGGAG 
GCGGAAGGAG 


CTCCGGTGGA 
CTCCGGTGGA 


CTTTGCCGAC 
CTTTGCCGAC 


AGGTACCAAA 
AGGTACCAAA 


ACAAATGTTC 
ACAAATGTTC 


GCGGAAGGAG 


CTCCGGTGGA 


CTTTGCCGAC 


AGGTACCAAA ACAAATGTTC 



GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAGTGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 



GCGGA. . .AG CTTCGATAAA CTACGCGGGC AGGTACCAAA ACAAATGTTC 
GCGGA...AG CTTCGATAAA CTACGCGGAC AGGTACCAAA ACAAATGTTC 
GCGGA... AG CTTCGATAAA CTACGCGGAC AGGTACCAAA ACAAATGTTC 
GCGGA... AG CTTCGATAAA CTACGCGGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGA ... AG CTTCGATCAA CTACGCAGAC AGGTACCAAA ACAAATGTTC 
GCGGA... AG CACCGGCGGA CTACGCGGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
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Fig. IAN 

1951 

4 2_2 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG . CAA GACATGCGAG 

42_8 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

42_15 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

42 5b TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

- lb GAATTC GCCCTT . . . . GGCTGCGTC 

42_13 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

42_3a TCGTCACGCG GGCATGCTTC AGATGCTGCT TCCCTG. CAA GACATGCGAG 

J 2 - 4 GAATTC GCCCTTTCTA CGGCTGCGTC 

42 5a TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA AACATGCGAG 
A Z-\° GAATTC GCCCTTTCTA CGGCTGCGTC 

A i-„ GAATTC GCCCTTTCTA CGGCTGCGTC 

42_11 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

42_6b TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

4 3_1 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA AACGTGCGAG 

A\l ^^ T ^ CGCG GGCATGCTTC AGACGCTGTT TCCCTG. CAA AACGTGCGAG 

4 3_12 TCGTCACGCG GGCATGCTCC AGATGCTGTT TCCCTG. CAA AACGTGCGAG 

43 20 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 
4 3 21 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 
43_23 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG CAA GACATGCGAG 
43_25 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

44_1 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA AACATGCGAG 

223 10 TCGTCACGCG GGCATGCT TC AGATGCTGTT TCCCTG. CAA AACATGCGAG 

223 2 . . . 

223_4 '/[ 

223 5 ' 

223 6 \ [ 

223J7 ]'/ 

A3_4 TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG .' TCG ACAATGCGAA 

A3_5 TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG. TCG ACAATGCGAA 

A3_7 TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG . TCG ACAATGCGAA 

A3_3 TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG. TCG ACAATGCGAA 

42_12 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

AAV1 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA GACATGCGAG 

AAV2 TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG. CAG ACAATGCGAG 

AAV3 TCGTCACGTG GGCATGAATC TGATGCTTTT TCCCTG. TAA AACATGCGAG 

AAV 8 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA AACGTGCGAG 

AAV9 TCGTCACGCG GGCATGCTTC AGATGCTGCT TCCCTG. CAA AACGTGCGAG 

AAV7 TCGTCACGCG GGCATGATTC AGATGCTGTT TCCCTG. CAA AACGTGCGAG 

4 4_2 TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG. CAA AACATGCGAG 
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Fig. 1AO 

2001 

42 2 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTP 

42 8 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
_ 5 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCGCGGGA CCAGAGaSg 

42 5b AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGaSg 

2 lb A.ACTGGACC A. .ATGAGAA CTTTCCCTTC A A CGaSSSc 

42 3 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
42 3a AGAATGAATC AGAATTTCAG CATTTGCTTC ACGCACGGGA CCAGAGACTG 

2 4 A.ACTGGACC A.. ATGAGAA CTTTCCCTTC A A CGMT^Sr 

42 5a AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGaSg 

2_10 A.ACTGGACC A. .ATGAGAA CTTTCCCTTC A A CGATTGCGTC 

42 3b A. ACTAGACC A. .ATGAGAA CTTTCCCTTC A 'a CGaSgcGIC 

42 11 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCGGAGaSg 

42 6b AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGaSg 
" ^ GAATC AGAATTTCAA CATTTGCTTC ACGCACGGGG TCAGAGACTG 

43 5 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGG TCAGAGACTG 
-,n AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGG TCAGAGACTG 

?Ay TGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 

43 21 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGaSg 
?A\ AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGaSg 

ll ? fAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGaSg 
_1 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
223 10 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
223_2 WWW. WW. 

223 4 .' 

223_5 

223_6 " " 

223_7 ............ 

A3 4 AGAATGAATC AGAATTCAAA TATCTGCTTC ACACACGGGC AAAAAGACTC 
A3 5 AGAATGAATC AGAATTCAAA TATCTGCTTC ACACACGGGC AAAAAGACTG 
A3_7 AGAATGAATC AGAATTCAAA TATCTGCTTC ACACACGGGC AA^SSg 
A3_3 AGAATGAATC AGAATTCAAA TATCTGCTTC ACACACGGGC AAAAAGACTC 

42_12 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA SgaSg 
AAV1 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CGAGAGaSg 
AGAATGAATC AGAATTCAAA TATCTGCTTC ACTCACGGAC AGA^AGaSc 
AAV3 AGAATGAATC AAATTTCCAA TGTCTGTTTT ACGCATGGTC AA^AGaSg 
AAV AGAATGAATC AGAATTTCAA CATTTGCTTC ACACACGGGG TCAGAGACTG 
Tvn AG f TGAATC AGAATTTCAA CATTTGCTTC ACACACGGGG TCAGAGACTG 
AAV7 AGAATGAATC AGAATTTCAA CATTTGCTTC ACACACGGGG TCAGAGaSg 
4 4_2 AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGaSg 
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4 2_2 


TTCAGAATGT 


TTCCCCGGCG 


42 8 


TTCAGAATGT 


TTCCCCGGCG 


42 15 


TTCAGAATGT 


TTCCCGGGCG 


42 5b 


TTCAGAATGT 


TTCCCCGGCG 


42 lb 


GACAAGATGG 


TGATCTGGTG 


42 13 


TTCAGAATGT 


TTCCCCGGCG 


42 3a 


TTCAGAATGT 


TTCCCCGGCG 


42 4 


GACAAGATGG 


TGATCTGGTG 


42 5a 


TTCAGAATGT 


TTCCCCGGCG 


42 10 


GACAAGATGG 


TGATCTGGTG 


42 3b 


GACAAGATGG 


TGATCTGGTG 


42 11 


TTCAGAATGT 


TTCCCCGGCG 


42 6b 


TTCAGAATGT 


TTCCCCGGCG 


43 1 


CTCAGAATGT 


TTCCCCGGTG 


43 5 


CTCAGAATGT 


TTCCCCGGTG 


43 12 


CTCAGAATGT 


TTCCCCGGTG 


43 20 


TTCAGAATGT 


TTCCCCGGCG 


43 21 


TTCAGAATGT 


TTCCCCGGCG 


43 23 


TTCAGAATGT 


TTCCCCGGCG 


43 25 


TTCAGAATGT 


TTCCCCGGCG 


44 1 


TTCAGAATGT 


TTCCCCGGCG 


44 5 
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TTCCCCGGCG 


223 10 






223 2 






223 4 
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223 6 






223 7 






A3 4 


TTTGGAATGC 


TTTCCCG . . . 


A3 5 


TTTGGAATGC 


TTTCCCG. . . 


A3 7 


TTTGGAATGC 


TTTCCCG. . . 


A3 3 


TTTGGAATGC 


TTTCCCG. . . 


42_12 


TTCAGAATGT 


TTCCCCGGCG 


AAV1 


TTCAGAGTGC 


TTCCCCGGCG 


AAV 2 


TTTAGAGTGC 


TTTCCCG. . . 


AAV 3 


TGGGGAATGC 


TTCCCTGGAA 


AAV 8 


CTCAGAGTGT 


TTCCCCGGCG 


AAV 9 


CTCAGAGTGT 


TTCCCCGGCG 


AAV 7 


TTTAGAGTGT 


TTCCCCGGCG 


44 2 


TTCAGAATGT 


TTCCCCGGCG 



Fig. 1AP 

2100 



TGTCAGAATC 


TCAACC. . . . 


- . GGTCGTCA 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 
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GG . . AGGAGG 
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. . GGTCGTCA 
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TCAACC. . . . 


. . GGTCGTCA 


GG . . AGGAGG 


GCAAGA. . . . 


. . TGACGGCC 


TGTCAGAATC 


TCAACC .... 


. .GGTCGTCA 


GG. .AGGAGG 


GCAAGA .... 


. . TGACGGCC 


GG. .AGGAGG 


GCAAGA .... 


. . TGACGGCC 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


TGTCAGAATC 


TCAACC .... 


. .GGTCGTCA 


CATCAGAATC 


TCAACC .... 


. .GGTCGTCA 


CATCAGAATC 


TCAACC .... 


. .GGTCGTCA 


CATCAGAATC 


TCAACC. . . . 


. . GGTCGTCA 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


TGTCAGAATC 


TCAACC. . . . 


. . GGTCGTCA 
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TCAACC .... 
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. .GGTCGTCA 


TGTCAGAATC 
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. . ........ 

. . ........ 


. . GGTTGTCA 



TGTCAGAATC 


.......... 



TCAACCCGTT 




TCTGTCGTCA 


TGTCAGAATC 


TCAACCCGTT 


CCTGTCGTCA 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


TGTCAGAATC 


TCAACC .... 


. .GGTCGTCA 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


TGTCAGAATC 


TCAACCCGTT 


TCTGTCGTCA 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


TGTCAGAATC 


TCAACC. . . . 


. .GGTCGTCA 


TGTCAGAATC 


TCAACC 


. .GGTCGTCA 
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Fig. 1AQ 

2101 2150 

42_2 GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG . CG 

42_8 GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTAGGG . CG 

42 15 GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG. CG 

42~5b GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG. CG 

42~lb . AAGGTCGTG GAGTCCGCCA AG GCCA TTCATCATCT GCTGGGG. CG 

42~13 GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG . CG 

42~3a GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG . CG 

42 4 .AAGGTCGTG GAGTCCGCCA AG GCCA TTCATCATCT GCTGGGG . CG 

42 5a GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG. CG 

4 2~10 AA....GGTC GTGAAGTCCG CCAAG.GCCA TTCATCATCT GCTGGGG. CG 

42~3b AA GGTC GTGGAGTCCG CCAAG.GCCA TTCATCATCT GCTGGGG . CG 

4 2~~11 GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG . CG 

4 2 _ 6b GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG. CG 



43 1 


GAAAAAAAAC 


GTATCAGAAA 


CTGTGTGCCA 


TTCATCATCT 


GCTGGGG. 


CG 


43 5 


GAAAAAAAAC 


GTATCAGAAA 


CTGTGTGCCA 


TTCATCATCT 


GCTGGGG. 


CG 


43 12 


GAAAAAAAAC 


GTATCAGAAA 


CTGTGTGCCA 


TTCATCATCT 


GCTGGGG . 


CG 


43 20 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG. 


CG 


43 21 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG. 


CG 


43 23 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG. 


CG 


43 25 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG. 


CG 


44 1 


GAAAAAAGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG. 


CG 


44 5 
!23 10 


GAAAAAAGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG. 


CG 
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A3 4 


GAAAAACG . . 


. TAT C AG AAA 


CTTTGTTACA 


TT CAT CAT AT 


CATGGGA, 


.AA 


A3 5 


GAAAAACG . . 


. TATCAGAAA 


CTTTGTTACA 


TTCATCATAT 


CATGGGA 


.AA 


A3 7 


GAAAAACG . . 


. TATCAGAAA 


CTTTGTTACA 


TTCATCATAT 


CATGGGA 


.AA 


A3 3 


GAAAAACG . . 


. TATCAGAAA 


CTTTGTTACA 


TTCATCATAT 


CATGGGA 


.AA 


42 12 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG 


.CG 


AAV1 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG 


.CG 


AAV 2 


AAAAGGCG. . 


. TATCAGAAA 


CTGTGCTACA 


TTCATCATAT 


CATGGGA 


. AA 


AAV 3 


AAAAGAAGAC 


T TATCAGAAA 


CTGTGTCCAA 


TTCATCATAT 


CCTGGGA 


.AG 


AAV8 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG 


.CG 


AAV 9 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG 


.CG 


AAV 7 


GAAAAAAGAC 


GTATCGGAAA 


CTCTGCGCGA 


TTCATCATCT 


GCTGGGG 


.CG 


44 2 


GAAAAAAGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGGGCG 
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Fig. 1AR 

2151 

4 2__2 GGCTCCCGAG ATTGCTTGCT 

4 2_8 GGCTCCCGAG ATTGCTTGCT 

42_15 GGCTCCCGAG ATTGCTTGCT 

42_5b GGCTCCCGAG ATTGCTTGCT 

4 2_lb GGCTCCCGAG ATTGCTTGCT 

42_13 GGCTCCCGAG ATTGCTTGCT 

4 2_3a GGCTCCCGAG ATTGCTTGCT 

42_J GGCTCCCGAG ATTGCTTGCT 

4 2_5a GGCTCCCGAG ATTGCTTGCT 

42_10 GGCTCCCGAG ATTGCTTGCT 

4 2_3b GGCTCCCGAG ATTGCTTGCT 

4 2__11 GGCTCCCGAG ATTGCTTGCT 

42__6b GGCTCCCGAG ATTGCTTGCT 

4 3_1 GGCACCCGAG ATTGCTTGCT 

4 3_5 GGCACCCGAG ATTGCTTGCT 

4 3__12 GGCACCCGAG ATTGCTTGCT 

4 3__20 GGCTCCCGAG ATTGCTTGCT 

4 3_21 GGCTCCCGAG ATTGCTTGCT 

4 3_23 GGCTCCCGAG ATTGCTTGCT 

4 3__25 GGCTCCCGAG ATTGCTTGCT 

4 4__1 GGCACCCGAG ATTGCTTGCT 

44 5 GGCACCCGAG ATTGCTTGCT 
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A3 4 


AGAACCAGAC 


. . . GCCTGCA 


CTGCCTGCGA 


CCTGGTAAAT 


GTGGACTTGG 


A3 5 


AGTACCAGAC 


. . . GCCTGCA 


CTGCCTGCGA 


CCTGGTAAAT 


GTGGACTTGG 


A3 7 


AGTACCAGAC 


. . .GCCTGCA 


CTGCCTGCGA 


CCTGGTAAAT 


GTGGACTTGG 


A3 3 


AGTACCAGAC 


. . . GCCTGCA 


CTGCCTGCGA 


CCTGGTAAAT 


GTGGACTTGG 


42 12 


GGCTCCCGAG 


ATTGCTTGCT 


CGGCCTGCGA 


TCTGGTCAAC 


GTGGACCTGG 


AAV1 


GGCTCCCGAG 


ATTGCTTGCT 


CGGCCTGCGA 


TCTGGTCAAC 


GTGGACCTGG 


AAV2 


GGTGCCAGAC 


. . . GCTTGCA 


CTGCCTGCGA 


TCTGGTCAAT 


GTGGATTTGG 


AAV3 


GGCACCCGAG 


ATTGCCTGTT 


CGGCCTGCGA 


TTTGGCCAAT 


GTGGACTTGG 


AAV 8 


GGCTCCCGAG 


ATTGCTTGCT 


CGGCCTGCGA 


TCTGGTCAAC 


GTGGACCTGG 


AAV 9 


GGCTCCCGAG 


ATTGCTTGCT 


CGGCCTGCGA 


TCTGGTCAAC 


GTGGACCTGG 


AAV 7 


GGCGCCCGAG 


ATTGCTTGCT 


CGGCCTGCGA 


CCTGGTCAAC 


GTGGACCTGG 


44 2 


GGCACCCGAG 


ATTGCTTGCT 


CGGCCTGCGA 


TCTGGTCAAC 


GTGGACCTAG 
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GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
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GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
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GTGGACCTGG 
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GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTAG 
GTGGACCTAG 
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Fig. IAS 

2201 2250 

Rep 78 stop vpl start 

ATGACCGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 



ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 
ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 
ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 
ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGCAT CTTTGAACAA TAAATGATTT AAATCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCTGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ACGACTGCGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 
ATGACTGTGT TTCTGAGCAA TAAA TGACTT AAACCAGGT A TG GCTGCCGA 

Rep78 stop vpl start 
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Fig. 1AT 

2251 2300 

Rep 6 8 stop 

TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTCATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 



CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 
CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 
CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 
CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATCCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATAAGAC 
CGGTTATCTT CCAGATTGGC TCGAGGACAA CCTTTCTGAA GGCATTCGTG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 
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Fig. 1AU 



2301 2350 
AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 
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AACCCAAAGC 
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AGTGGTGGAA GCTCAAACCT GGCCCACCAC CGCCGAAACC TAACCAACAA 

AGTGGTGGAA GCTCAAACCT GGCCCACCAC CGCCGAAACC TAACCAACAA 

AGTGGTGGAA GCTCAAACCT GGCCCACCAC CGCCGAAACC TAACCAACAA 

AGTGGTGGAA GCTCAAACCT GGCCCACCAC CGCCGAAACC TAACCAACAA 

AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 

AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AGCCCAAAGC CAACCAGCAA 

AGTGGTGGAA GCTCAAACCT GGCCCACCAC CACCAAAGCC CGCAGAGCGG 

AGTGGTGGGC TCTGAAACCT GGAGTCCCTC AACCCAAAGC GAACCAACAA 

AGTGGTGGGC GCTGAAACCT GGAGCCCCGA AGCCCAAAGC CAACCAGCAA 

AGTGGTGGGC GCTGAAACCT GGAGCCCCGA AGCCCAAAGC CAACCAGCAA 

AGTGGTGGGA CCTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 

AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 
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Fig. 1AV 
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ACGGCCGGGG TCTGGTGCTT 
ACGGCCGGGG TCTGGTGCTT 



2400 

CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 
CCTGGCTACA AGTACCTCGG 



223 2 ' ' 

223 4 [] 

223 5 [ [[ 

223 6 \ m 

223 _J * . * " 

A3_4 CACCGGGACG ACAGTAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 

A3_5 CACCGGGACG ACAGTAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 

A3J7 CACCGGGACG ACAGTAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 

A3_3 CACCGGGACG ACAGTAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 

42JL2 AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 

AAV1 AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 

AAV2 CATAAGGACG ACAGCAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 

AAV 3 CACCAGGACA ACCGTCGGGG TCTTGTGCTT CCGGGTTACA AATACCTCGG 

AAV8 AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 

AAV 9 AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 

AAV 7 AAGCAGGACA ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 

4 4_2 AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 



48/105 



WO 03/042397 



PCT/US02/33629 



Fig. 1AW 

2401 2450 

4 2_2 ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

42~8 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 2_15 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 2~5b ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

4 2_lb ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

42_13 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 2__3a ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 2_4 ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

4 2_5a ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

42~~10 ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

42~3b ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

4 2_11 ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGCG GCGGACGCAG 

4 2 6b ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

43 1 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 3~S ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

43_12 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 3__20 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 3~21 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 3~23 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 3~25 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

44_1 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 4_5 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

223^10 

223 2 

223 4 

223 5 

223 6 

223 7 

A3_J ACCCTTCAAC GGACTCGACA AAGGAGAGCC GGTCAACGAG GCAGACGCCG 

A3~S ACCCTTCAAC GGACTCGACA AAGGAGAGCC GGTCAACGAG GCAGACGCCG 

A3~7 ACCCTTCAAC GGACTCGACA AAGGAGAGCC GGTCAACGAG GCAGACGCCG 

A3~3 ACCCTTCAAC GGACTCGACA AAGGAGAGCC GGTCAACGAG GCAGACGCCG 

42 12 ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

AAV1 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

AAV2 ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 

AAV3 ACCCGGTAAC GGACTCGACA AAGGAGAGCC GGTCAACGAG GCGGACGCGG 

AAV8 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

AAV 9 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

AAV7 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 

4 4 2 ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
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Fig. 1AX 

2451 

4 2_2 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

4 2_8 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42__15 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42_5b CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42_lb CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42_13 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42_3a CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42_J CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42_5a CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42_10 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

42_3b CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

4 2_11 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

4 2_6b CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

4 3_1 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

4 3_5 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

4 3_12 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

43_20 CGGCCCTCGA GCACG.ACAA AGCCTACGAC 

4 3_21 CGGCCCTCGA GCACG.ACAA AGCCTACGAC 

43_23 CGGCCCTCGA GCACG.ACAA AGCCTACGAC 

43_25 CGGCCCTCGA GCACG.ACAA AGCCTACGAC 

44_1 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

4 4 5 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 



223_10 CAA GGCCTACGAC 

223_2 CAA GGCCTACGAC 

223_4 CAA GGCCTACGAC 

223_5 CAA GGCCTACGAC 

223_6 CAA GGCCTACGAC 

223 7 CAA GGCCTACGAC 



A3_4 CGGCCCTCGA GCACG.ACAA AGCCTACGAC 

A3_5 CGGCCCTCGA GCACG.ACAA AGCCTACGAC 

A3J7 CGGCCCTCGA GCACG.ACAA AGCCTACGAC 

A3_3 CGGCCCTCGA GCACG.ACAA AGCCTACGAC 

4 2_12 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

AAV1 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

AAV2 CGGCCCTCGA GCACGTACAA AGCCTACGAC 

AAV 3 CAGCCCTCGA ACACG.ACAA AGCTTACGAC 

AAV8 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

AAV 9 CGGCCCTCGA GCACG.GCAA GGCCTACGAC 

AAV 7 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 

44 2 CGGCCCTCGA GCACG.ACAA GGCCTACGAC 



2500 

AAGCAGCTCG AGCAGGGGGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
AAGCAGCTCG AGCAGGGGGA 
AAGCAGCTCG AGCAGGGGGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
AAGCAGCTCG AGCAGGGGGA 
AAGCAGCTCG AGCAGGGGGA 
AAGCAGCTCG AGCAGGGGGA 
AAGCAGCTCG AGCAGGGGGA 
CAGCAGCTCA AAGCGGGTGA 
AAGCAGCTCG AGCAGGGGGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CACCAGCTCA AGCAAGGGGA 
CACCAGCTCA AGCAAGGGGA 
CACCAGCTCA AGCAAGGGGA 
CACCAGCTCA AGCAAGGGGA 
AAGCAGCTCG AGCAGGGGGA 
CAGCAGCTCA AAGCGGGTGA 
CGGCAGCTCG ACAGCGGAGA 
CAGCAGCTCA AGGCCGGTGA 
CAGCAGCTGC AGGCGGGTGA 
CAGCAGCTGC AGGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
CAGCAGCTCA AAGCGGGTGA 
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CAACCCGTAC 
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CAATCCGTAC 


CTGCGGTATA 
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CAATCCGTAC 


CTGCGGTATA 
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CAACCCGTAC 


CTCAAGTACA 
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CAACCCGTAC 


CTCAAGTACA 
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CAATCCGTAC 


CTGCGGTATA 
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CAATCCGTAC 


CTGCGGTATA 
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CAACCCGTAC 


CTCAAGTACA 
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CAACCCGTAC 


CTCAAGTACA 
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CAACCCGTAC 


CTCAAGTACA 


42 3b 


CAACCCGTAC 


CTCAAGTACA 


42 11 


CAATCCGTAC 


CTGCGGTATA 


42 6b 


CAACCCGTAC 


CTCAAGTACA 


43 1 


CAATCCGTAC 


CTGCGGTATA 


43 5 


CAATCCGTAC 


CTGCGGTATA 


43 12 


CAATCCGTAC 


CTGCGGTATA 


43 20 


CAATCCGTAC 


CTGCGGTATA 


43 21 


CAATCCGTAC 


CTGCGGTATA 


43 23 


CAATCCGTAC 


CTGCGGTATA 


43 25 


CAATCCGTAC 


CTGCGGTATA 


44 1 


CAATCCGTAC 


CTGCGGTATA 


44 5 


CAATCCGTAC 


CTGCGGTATA 


223 10 


CAATCCGTAC 


CTGCGGTATA 


223 2 


CAATCCGTAC 


CTGCGGTATA 


223 4 


CAATCCGTAC 


CTGCGGTATA 


223 5 


CAATCCGTAC 


CTGCGGTATA 


223 6 


CAATCCGTAC 


CTGCGGTATA 


223 7 


CAATCCGTAC 


CTGCGGTATA 


A3 4 


CAACCCGTAC 


CTCAAATACA 


A3 5 


CAACCCGTAC 


CTCAAATACA 


A3 7 


CAACCCGTAC 


CTCAAATACA 


A3 3 


CAACCCGTAC 


CTCAAATACA 


42_12 


CAACCCGTAC 


CTCAAGTACA 


AAV1 


CAATCCGTAC 


CTGCGGTATA 


AAV2 


CAACCCGTAC 


CTCAAGTACA 


AAV 3 


CAACCCGTAC 


CTCAAGTACA 


AAV 8 


CAATCCGTAC 


CTGCGGTATA 


AAV 9 


CAATCCGTAC 


CTGCGGTATA 


AAV 7 


CAATCCGTAC 


CTGCGGTATA 


44 2 


CAATCCGTAC 


CTGCGGTATA 



Fig. 1AY 

2550 

ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ATCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ATCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ATCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ATCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGTGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCGGA CGCTGAATTT CAGGAGCGTC 
ACCACGCGGA CGCTGAATTT CAGGAGCGTC 
ACCACGCGGA CGCTGAATTT CAGGAGCGTC 
ACCACGCGGA CGCTGAATTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCGGAGTTT CAGGAGCGCC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
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42 2 


TTCAAGAAGA 


42 8 


TGCAAGAAGA 


42 15 


TGCAAGAAGA 


42 5b 


TTCAAGAAGA 


42 lb 


TTCAAGAAGA 


42 13 


TTCAAGAAGA 


42 3a 


TTCAAGAAGA 


42 4 


TTCAAGAAGA 


42 5a 


TTCAAGAAGA 


42 10 


TTCAAGAAGA 


42 3b 


TTCAAGAAGA 


42 11 


TTCAAGAAGA 


42 6b 


TTCAAGAAGA 


43 1, 


TGCAAGAAGA 


43 5 


TGCAAGAAGA 


43 12 


TGCAAGAAGA 


43 20 


TGCAAGAAGA 


43 21 


TGCAAGAAGA 


43 23 


TGCAAGAAGA 


43 25 


TGCAAGAAGA 


44 1 


TGCAAGAAGA 


44 5 


TGCAAGAAGA 


223 10 


TTCAAGAAGA 


223 2 


TTCAAGAAGA 


223 4 


TTCAAGAAGA 


223 5 


TTCAAGAAGA 


223 6 


TTCAAGAAGA 


223 7 


TTCAAGAAGA 


A3 4 


TTCAAGAAGA 


A3 5 


TTCAAGAAGA 


A3 7 


TTCAAGAAGA 


A3 3 


TTCAAGAAGA 


42_12 


TTCAAGAAGA 


AAV1 


TGCAAGAAGA 


AAV 2 


TTAAAGAAGA 


AAV 3 


TTCAAGAAGA 


AAV 8 


TGCAAGAAGA 


AAV 9 


TGCAAGAAGA 


AAV 7 


TGCAAGAAGA 


44 2 


TGCAAGAAGA 



Fig. 1AZ 



TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCCTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTC GGGGGCAACC 
TACGTCTTTC GGGGGCAACC 
TACGTCTTTC GGGGGCAACC 
TACGTCTTTC GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 
TACGTCATTT GGGGGCAACC 
TACGTCTTTT GGGGGCAACC 



2600 

TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCGG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGACGAGC AGTCTTCCAG 
TTGGCAGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
TCGGGCGAGC AGTCTTCCAG 
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42 2 


GCCAAGAAGC 


GGGTTCTCGA 


42 8 


GCCAAGAAGC 


GGGTTCTCGA 


42 15 


GCCAAGAAGC 


GGGTTCTCGA 


42 5b 


GCCAAGAAGC 


GGGTTCTCGA 


42 lb 


GCCAAGAAGC 


GGGTTCTCGA 


42 13 


GCCAAGAAGC 


GGGTTCTCGA 


42 3a 


GCCAAGAAGC 


GGGTTCTCGA 


42 4 


GCCAAGAAGC 


GGGTTCTCGA 


42 5a 


GCCAAGAAGC 


GGGTTCTCGA 


42 10 


GCCAAGAAGC 


GGGTTCTCGA 


42 3b 


GCCAAGAAGC 


GGGTTCTCGA 


42_11 


GCCAAGAAGC 


GGGTTCTCGA 


42~*6b 


GCCAAGAAGC 


GGGTTCTCGA 


43 1 


GCCAAGAAGC 


GGGTTCTCGA 


4 3_5 


GCCAAGAAGC 


GGGTTCTCGA 


4 3 12 


GCCAAGAAGC 


GGGTTCTCGA 


43 20 


GCCAAGAAGC 


GGGTTCTCGA 


43 21 


GCCAAGAAGC 


GGGTTCTCGA 


43 23 


GCCAAGAAGC 


GGGTTCTCGA 


43 25 


GCCAAGAAGC 


GGGTTCTCGA 


44 1 


GCCAAGAAGC 


GGGTTCTCGA 


44 5 


GCCAAGAAGC 


GGGTTCTCGA 


223 10 


GCCAAAAAGC 


GGGTTCTCGA 


223 2 


GCCAAAAAGC 


GGGTTCTCGA 


223 4 


GCCAAAAAGC 


GGGTTCTCGA 


223 5 


GCCAAAAAGC 


GGGTTCTCGA 


223 6 


GCCAAAAAGC 


GGGTTCTCGA 


223 7 


GCCAAAAAGC 


GGGTTCTCGA 


A3 4 


GCCAAAAAGA 


GGGTACTCGA 


A3 5 


GCCAAAAAGA 


GGGTACTCGA 


A3 7 


GCCAAAAAGA 


GGGTACTCGA 


A3 3 


GCCAAAAAGA 


GGGTACTCGA 


42 12 


GCCAAGAAGC 


GGGTTCTCGA 


AAV1 


GCCAAGAAGC 


GGGTTCTCGA 


AAV2 


GCGAAAAAGA 


GGGTTCTTGA 


AAV 3 


GCCAAAAAGA 


GGATCCTTGA 


AAV 8 


GCCAAGAAGC 


GGGTTCTCGA 


AAV 9 


GCCAAGAAGC 


GGGTTCTCGA 


AAV 7 


GCCAAGAAGC 


GGGTTCTCGA 


44 2 


GCCAAGAAGC 


GGGTTCTCGA 



Fig. 1AAA 
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ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
GCCTCTTGGT CTGGTTGAGG AAGCTGTTAA 
GCCTCTTGGT CTGGTTGAGG AAGCTGTTAA 
GCCTCTTGGT CTGGTTGAGG AAGCTGTTAA 
GCCTCTTGGT CTGGTTGAGG AAGCTGTTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTGGGC CTGGTTGAGG AACCTGTTAA 
GCCTCTTGGT CTGGTTGAGG AAGCAGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 



53/105 



WO 03/042397 



PCT/US02/33629 



2651 





vp2 start 




42 2 


GACGGCTCCT 


GGAAAGAAGA 


42 8 


GACGGCTCCT 


GGAAAGAAGA 


42 15 


GACGGCTCCT 


GGAAAGAAGA 


42 5b 


GACGGCTCCT 


GGAAAGAAGA 


42 lb 


GACGGCTCCT 


GGAAAGAAGA 


42 13 


GACGGCTCCT 


GGAAAGAAGA 


42 3a 


GACGGCTCCT 


GGAAAGAAGA 


42 4 


GACGGCTCCT 


GGAAAGAAGA 


42 5a 


GACGGCTCCT 


GGAAAGAAGA 


42 10 


GACGGCTCCT 


GGAAAGAAGA 


42 3b 


GACGGCTCCT 


GGAAAGAAGA 


42 11 


GACGGCTCCT 


GGAAAGAAGA 


42 6b 


GACGGCTCCT 


GGAAAGAAGA 


43 1 


GACGGCTCCT 


GGAAAGAAGA 


43 5 


GACGGCTCCT 


GGAAAGAAGA 


43 12 


GACGGCTCCT 


GGAAAGAAGA 


43 20 


GACGGCTCCT 


GGAAAGAAGA 


43 21 


GACGGCTCCT 


GGAAAGAAGA 


43 23 


GACGGCTCCT 


GGAAAGAAGA 


43 25 


GACGGCTCCT 


GGAAAGAAGA 


44 1 


GACGGCTCCT 


GGAAAGAAGA 


44 5 


GACGGCTCCT 


GGAAAGAAGA 


223 10 


GACGGCACCT 


GGAAAGAAGC 


223 2 


GACGGCACCT 


GGAAAGAAGC 


223 4 


GACGGCACCT 


GGAAAGAAGC 


223 5 


GACGGCACCT 


GGAAAGAAGC 


223 6 


GACGGCACCT 


GGAAAGAAGC 


223 7 


GACGGCACCT 


GGAAAGAAGC 


A3 4 


GACGGCTCCT 


GGAAAAAAGA 


A3 5 


GACGGCTCCT 


GGAAAAAAGA 


A3 7 


GACGGCTCCT 


GGAAAAAAGA 


A3 3 


GACGGCTCCT 


GGAAAAAAGA 


42^12 


GACGGCTCCT 


GGAAAGAAGA 


AAV1 


GACGGCTCCT 


GGAAAGAAAC 


AAV2 


GACGGCTCCG 


GGAAAAAAGA 


AAV 3 


AACGGCTCCT 


GGAAAGAAGG 


AAV 8 


GACGGCTCCT 


GGAAAGAAGA 


AAV 9 


GACGGCTCCT 


GGAAAGAAGA 


AAV 7 


GACGGCTCCT 


GGAAAGAAGA 


44 2 


GACGGCTCCT 


GGAAAGAAGA 




vp2 start 





Fig. 1AAB 
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GACCCATAGA . . . ATCCCCC 

GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA . . . ATCCCCC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA . . .ATCCCCC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA . . .ATCCCCC 

GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCT CAGCGTTCCC 
GACCGGTAGA GCCATCACCT CAGCGTTCCC 
GACCGGTAGA GCCATCACCT .CAGCGTTCCC 
GACTGGTAGA GCAGTCGCCA CAAGAG . . . C 
GACCGGTAGA GCAGTCGCCA CAAGAG . . . C 
GACCGGTAGA GCAGTCGCCA CAAGAG . . . C 
GACCGGTAGA GCAGTCGCCA CAAGAG . . . C 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCTATAGA GCAGTCTCCT GCAGAA. . .C 
GACCTATAGA GCAGTCTCCT GCAGAA... C 
GACCTATAGA GCAGTCTCCT GCAGAA... C 
GACCTATAGA GCAGTCTCCT GCAGAA... C 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GTCCGGTAGA GCAGTCGCCA CAAGAG . . . C 
GGCCGGTAGA GCACTCTCCT GTGGAG...C 
GGGCTGTAGA TCAGTCTCCT CAGGAA. . . C 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCGTCACCT CAGCGTTCCC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
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42 2 


. . GACTCCTC 


42 8 


CAGACTCCTC 


42 15 


CAGACTCCTC 


42 5b 


CAGACTCCTC 


42 lb 


. .GACTCCTC 


42 13 


. . GACTCCTC 


42 3a 


. . GACTCCTC 


42 4 


. .GACTCCTC 


42 5a 


. . GACTCCTC 


42 10 


. . GACTCCTC 


42 3b 


. .GACTCCTC 


42 11 


. . GACTCCTC 


42 6b 


CAGACTCCTC 


43 1 


CCGACTCCTC 


43 5 


CCGACTCCTC 


43 12 


CCGACTCCTC 


43 20 


CAGACTCCTC 


43 21 


CAGACTCCTC 


43 23 


CAGACTCCTC 


43 25 


CAGACTCCTC 


44 1 


CAGACTCCTC 


44 5 


CAGACTCCTC 


223 10 


. . GACTCCAC 


223 2 


. . GACTCCAC 


223 4 


. . GACTCCAC 


223 5 


. . GACTCCAC 


223 6 


. . GACTCCAC 


223 7 


. . GACTCCAC 


A3 4 


CGGACTCTTC 


A3 5 


CGGACTCTTC 


A3 7 


CGGACTCTTC 


A3 3 


CGGACTCTTC 


42 12 


CAGACTCCTC 


AAV1 


CAGACTCCTC 


AAV2 


CAGACTCCTC 


AAV3 


CGGACTCATC 


AAV8 


CAGACTCCTC 


AAV9 


CAGACTCCTC 


AAV7 


CCGACTCCTC 


4 4 2 


CAGACTCCTC 



Fig. 1AAC 



CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGACAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAGGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGACAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCGAATCAG 
CTCGGGCATC GGCAAATCAG 
CTCGGGCATC GGCAAATCAG 
CTCGGGCATC GGCAAATCAG 
TACGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGAACC GGAAAGGCGG 
ATCTGGTGTT GGCAAATCGG 
TACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 



2750 

GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCACCAGCC CGCGAGAAAG 
GCCACCAGCC CGCGAGAAAG 
GCCACCAGCC CGCGAGAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC TGCAAGAAAA 
GCAAACAGCC TGCCAGAAAA 
GCCAACAGCC CGCCAGAAAA 
GCCAACAGCC CGCCAGAAAA 
GCCAGCAGCC CGCGAGAAAG 
GCCAGCAGCC CGCGAAAAAG 
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42 2 


AAGCTCAACT 


42 8 


AGACTCAACT 


42 15 


AGACTCAACT 


42 5b 


AGACTCAACT 


42 lb 


AGACTCAACT 


42 13 


AAGCTCAACT 


42 3a 


AAGCTCAACT 


42 4 


AAGCTCAACT 


42 5a 


AAGCTCAACT 


42 10 


AAGCTCAACT 


42 3b 


AAGCTCAACT 


42 11 


AAGCTCAACT 


42 6b 


AGACTCAACT 


43 1 


AGACTGAACT 


43 5 


AGACTGAACT 


43 12 


AGACTGAACT 


43 20 


AGACTCAATT 


43 21 


AGACTCAATT 


43 23 


AGACTCAATT 


43 25 


AGACTCAATT 


44 1 


AGACTCAACT 


44 5 


AGACTCAACT 


223 10 


AGACTCAACT 


223 2 


AGACTCAACT 


223 4 


AGACTCAACT 


223 5 


AGACTCAACT 


223 6 


AGACTCAACT 


223 7 


AGACTCAACT 


A3 4 


AGACTCAATT 


A3 5 


AGACTCAATT 


A3 7 


AGACTCAATT 


A3 3 


AGACTCAATT 


42 12 


AGACTCAACT 


AAV1 


AGACTCAATT 


AAV2 


AGATTGAATT 


AAV3 


AGACTAAATT 


AAV 8 


AGACTCAATT 


AAV 9 


AGACTCAATT 


AAV 7 


AGACTCAATT 


44 2 


AGACTCAACT 



Fig. 1AAD 



TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCG 
TTGGGCAGAC TGGCGACTCG 
TTGGGCAGAC TGGCGACTCG 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGGCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGAGACGCA 
TCGGTCAGAC TGGAGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TCGGTCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 



2800 

GAGTCAGTGC CCGACCCCCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCCCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCACA 
GAGTCAGTCC CCGACCCACA 
GAGTCAGTCC CCGACCCACA 
GAGTCAGTCC CCGACCCACA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGCCAGTCC CCGACCCTCA 
GAGCCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CAGACCCTCA 
GAGTCAGTCC CAGACCCTCA 
GAGTCAGTCC CAGACCCTCA 
GAGTCAGTCC CAGGCCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTCC CCGATCCACA 
GACTCAGTAC CTGACCCCCA 
GAGTCAGTCC CAGACCCTCA 
GAGTCAGTTC CAGACCCTCA 
GAGTCAGTTC CAGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
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2801 



2850 



vp 3w start 



42_2 
42_8 
42_15 
42_5b 
42_lb 
42_13 
42_3a 
42_4 
42_5a 
42_10 
42_3b 
42_11 
4 2_6b 
4 3_1 
4 3_5 
43_12 
4 3_20 
4 3_21 
4 3_23 
4 3_25 
44_1 
44_5 
223_10 
223^2 
223_4 
223_5 
223^6 
223J7 
A3_4 
A3_5 
A3_7 
A3_3 
4 2_12 
AAV1 
AAV2 
AAV 3 
AAV 8 
AAV 9 
AAV 7 
44 2 



ACCTCTCGGA GAACCTCCCG CCGCGCCCTC AGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGCACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCTCTCGGA GAACCTCCCG CCGCGCCCTC AGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTGTGGGA TCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTGTGGGA TCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTGTGGGA TCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTGTGGGA TCTAATACAA 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 

ACCTCTCGGA GAACCTCCAG CAACCCCCGC TGCTGTGGGA CCTACTACAA 

GCCTCTCGGA CAGCCACCAG CAGCCCCCTC TGGTCTGGGA ACTAATACGA 

ACCTCTCGGA GAACCACCAG CAGCCCCCAC AAGTTTGGGA TCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCGCCCTC TGGTGTGGGA CCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCGCCCTC TGGTGTGGGA CCTAATACAA 

ACCTCTCGGA GAACCTCCAG CAGCGCCCTC TAGTGTGGGA TCTGGTACAG 

ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 




vp3 start 
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^p3 start codon 

4 2_2 TGGCTGCAGG CGGTGGCGCA 

4 2_8 TGGCTGCAGG CGGTGGCGCT 

42_15 TGGCTGCAGG CGGTGGCGCT 

42_5b TGGCTGCAGG CGGTGGCGCT 

4 2_lb TGGCTGCAGG CGGTGGCGCT 

4 2_13 TGGCTGCAGG CGGTGGCGCT 

4 2_3a TGGCTGCAGG CGGTGGCGCT 

42__4 TGGCTGCAGG CGGTGGCGCT 

4 2_5a TGGCTGCAGG CGGTGGCGCA 

4 2_10 TGGCTGCAGG CGGTGGCGCT 

4 2_3b TGGCTGCAGG CGGTGGCGCT 

42__11 TGGCTGCAGG CGGTGGCGCT 

4 2__6b TGGCTGCAGG CGGTGGCGCT 

4 3_1 TGGCTGCAGG CGGTGGCGCT 

43_5 TGGCTGCAGG CGGTGGCGCT 

4 3_12 TGGCTGCAGG CGGTGGCGCT 

43_20 TGGCTTCAGG CGGTGGCGCT 

4 3_21 TGGCTTCAGG CGGTGGCGCT 

4 3_23 TGGCTTCAGG CGGTGGCGCT 

4 3_25 TGGCTTCAGG CGGTGGCGCT 

4 4_1 TGGCTGCAGG CGGTGGCGCT 

4 4_5 TGGCTGCAGG CGGTGGCGCT 

223_10 TGGCTGCAGG CGGTGGCGCA 

223_2 TGGTTGCAGG CGGTGGCGCA 

223_4 TGGCTGCAGG CGGTGGCGCA 

223_5 TGGCTGCAGG CGGTGGCGCA 

223_6 TGGCTGCAGG CGGTGGCGCA 

223J7 TGGCTGCAGG CGGTGGCGCA 

A3_4 TGGCTTCAGG CGGTGGGGCA 

A3_5 TGGCTTCAGG CGGTGGGGCA 

A3J7 TGGCTTCAGG CGGTGGGGCA 

A3__3 TGGCTTCAGG CGGTGGGGCA 

42_12 TGGCTGCAGG CGGTGGCGCT 

AAV1 TGGCTTCAGG CGGTGGCGCA 

AAV 2 TGGCTACAGG CAGTGGCGCA 

AAV 3 TGGCTTCAGG CGGTGGCGCA 

AAV 8 TGGCTGCAGG CGGTGGCGCA 

AAV 9 TGGCTGCAGG CGGTGGCGCA 

AAV7 TGGCTGCAGG CGGTGGCGCA 

4 4_2 TGGCTGCAGG CGGTGGCGCT 
vp3 start codon (cont'< 



Fig. 1AAF 

2900 

CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAGCGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCAG ACGATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA GGGCGCCGAC 
CCAATGGCAG ACAATAACGA GGGTGCCGAT 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGTGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
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42 2 


GGAGTGGGTA 


ATGCCTCCGG 


42 8 


GGAGTGGGTA 


GTTCCTCAGG 


42 15 


GGAGTGGGTA 


GTTCCTCAGG 


42 5b 


GGAGTGGGTA 


GTTCCTCAGG 


42 ib 


GGAGTGGGTA 


GTTCCTCAGG 


42 13 


GGAGTGGGTA 


GTTCCTCAGG 


42 3a 


GGAGTGGGTA 


GTTCCTCAGG 


42 4 


GGAGTGGGTA 


ATGCCTCCGG 


42 5a 


GGAGTGGGTA 


ATGCCTCCGG 


■ 42 10 


GGAGTGGGTA 


ATGCCTCCGG 


42 3b 


GGAGTGGGTA 


ATGCCTCCGG 


42 11 


GGAGTGGGTA 


ATGCCTCCGG 


42 6b 


GGAGTGGGTA 


GTTCCTCAGG 


43 1 


GGAGTGGGTA 


GTTCCTCAGG 


43 5 


GGAGTGGGTA 


GTTCCTCAGG 


43 12 


GGAGTGGGTA 


GTTCCTCAGG 


43 20 


GGAGTGGGTA 


ATTCCTCGGG 


43 21 


GGAGTGGGTA 


ATTCCTCGGG 


43 23 


GGAGTGGGTA 


ATTCCTCGGG 


43 25 


GGAGTGGGTA 


ATTCCTCGGG 


44 1 


GGAGTGGGTA 


GTTCCTCAGG 


44 5 


GGAGTGGGTA 


GTTCCTCAGG 


223 10 


GGAGTGGGTA 


ATGCCTCAGG 


223 2 


GGAGTGGGTA 


ATGCCTCAGG 


223 4 


GGAGTGGGTA 


ATGCCTCAGG 


223 5 


GGAGTGGGTA 


ATGCCTCAGG 


223 6 


GGAGTGGGTA 


ATGCCTCAGG 


223 7 


GGAGTGGGTA 


ATGCCTCAGG 


A3 4 


GGAGTGGGTA 


ATTCCTCGGG 


A3 5 


GGAGTGGGTA 


ATTCCTCGGG 


A3 7 


GGAGTGGGTA 


ATTCCTCGGG 


A3 3 


GGAGTGGGTA 


ATTCCTCGGG 


42_12 


GGAGTGGGTA 


GTTCCTCAGG 


AAV1 


GGAGTGGGTA 


ATGCCTCAGG 


AAV 2 


GGAGTGGGTA 


ATTCCTCCGG 


AAV 3 


GGAGTGGGTA ATTCCTCAGG 


AAV 8 


GGAGTGGGTA 


GTTCCTCGGG 


AAV 9 


GGAGTGGGTA 


ATTCCTCGGG 


AAV 7 


GGAGTGGGTA 


ATGCCTCAGG 


AAV10 


GGTA 


ATTCCTCCGG 


AAV11 


GGTA 


ATTCCTCCGG 


AAV12 


GGTA 


ATTCCTCCGG 


44 2 


GGAGTGGGTA 


GTTCCTCAGG 



Fig. 1AAG 

2950 

AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATAGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CACGGCTGGG 
AAATTGGCAT TGCGATTCCA CACGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCC AATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
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2951 


4 2 2 


CGACAGAGTC 


42__8 


CGACAGAGTC 


42 15 


CGACAGAGTC 


4 2_5b 


CGACAGAGTC 


42_ lb 


CGACAGAGTC 


42 13 


CGACAGAGTC 


42_3a 


CGACAGAGTC 


42_4 


CGACAGAGTC 


42 5a 


CGACAGAGTC 


42__10 


CGACAGAGTC 


42__3b 


CGACAGAGTC 


42__11 


CGACAGAGTC 


42_6b 


CGACAGAGTC 


4 3__i 


CGACAGAGTC 


4 3 5 


TV TV TV f-|T ^1 

CGACAGAGTC 


4 3 12 


CGACAGAGTC 


43 20 


GGACAGAGTC 


43 21 


GGACAGAGTC 


43 23 


GGACAGAGTC 


43 25 


GGACAGAGTC 


4 4 1 


TV TV /~> TV m 

CGACAGAGTC 


44 5 


CGACAGAGTC 


223_10 


CGACAGAGTC 


223 2 


CGACAGAGTC 


223 4 


CGACAGAGTC 


223 5 


CGACAGAGTC 


223_6 


CGACAGAGTC 


223J7 


CGACAGAGTC 


A3 4 


CGACAGAGTT 


A3 5 


CGACAGAGTT 


A3_7 


CGACAGAGTT 


A3 3 


CGACAGAGTT 


42_12 


CGACAGAGTC 


AAV1 


CGACAGAGTC 


AAV 2 


CGACAGAGTC 


AAV 3 


CGACAGAGTC 


AAV 8 


CGACAGAGTC 


AAV 9 


GGACAGAGTC 


AAV 7 


CGACAGAGTC 


AAV10 


CGACAGAGTC 


AAV11 


CGACAGAGTC 


AAV 12 


CGACCGAGTC 


44 2 


CGACAGAGTC 



ATCACCACCA GCACCCGCAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGCAC 
ATCACCACCA GCACCCGCAC 
ATCACCACCA GCACCCGCAC 
ATCACCACCA GCACCCGCAC 
ATCACCACCA GCACCCGCAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACAAGAAC 
ATCACCACCA GCACAAGAAC 
ATCACCACCA GCACAAGAAC 
ATCACCACCA GCACAAGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGCAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCAGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATTACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATCACCACCA GCACCCGAAC 
ATTACCACCA GCACCCGGAC 
ATCACCACCA GCACCCGAAC 



3000 

CTGGGCCCTG CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
CTGGGCCTTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCCCTG CCCACTTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGCATTG CCCACCTACA 
CTGGGCCCTG CCCACCTACA 
CTGGGTCCTG CCCACCTACA 
CTGGGCCCTG CCAACCTACA 
TTGGGCCCTG CCCACCTACA 
CTGGGCCCTC CCCACCTACA 
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3001 



42 2 


ACAACCACCT 


CTACAAGCAG 


42 8 


ACAACCACCT 


CTACAAGCAA 


42 15 


ACAACCACCT 


CTACAAGCAA 


42 5b 


ACAACCACCT 


CTACAAGCAA 


42 lb 


ACAACCACCT 


CTACAAGCAA 


42 13 


ACAACCACCT 


CTACAAGCAA 


42~~3a 


ACAACCACCT 


CTACAAGCAA 


42 4 


ACAACCACCT 


CTACAAGCAG 


42 5a 


ACAACCACCT 


CTACAAGCAG 


42 10 


ACAACCACCT 


CTACAAGCAG 


42 3b 


ACAACCACCT 


CTACAAGCAG 


42 11 


ACAACCACCT 


CTACAAGCAG 


42 6b 


ACAACCACCT 


CTACAAGCAA 


43 1 


ACAACCATCT 


CTACAAGCAA 


43 5 


ACAACCATCT 


CTACAAGCAA 


43 12 


ACAACCATCT 


CTACAAGCAA 


43 20 


ACAACCACCT 


CTACAAGCAA 


43 21 


ACAACCACCT 


CTACAAGCAA 


43 23 


ACAACCACCT 


CTACAAGCAA 


43 25 


ACAACCACCT 


CTACAAGCAA 


44 1 


ACAACCACCT 


CTACAAGCAA 


44 5 


ACAACCACCT 


CTACAAGCAA 


223 10 


ACAACCACCT 


CTACAAGCAA 


223 2 


ACAACCACCT 


CTACAAGCAA 


223 4 


ACAACCACCT 


CTACAAGCAA 


223 5 


ACAACCACCT 


CTACAAGCAA 


223 6 


ACAACCACCT 


CTACAAGCAA 


223 7 


ACAACCACCT 


CTACAAGCAA 


A3 4 


ATAATCACCT 


CTACAAGCAA 


A3 5 


ATAATCACCT 


CTACAAGCAA 


A3 7 


ATAATCGCCT 


CTACAAGCAA 


A3 3 


ATAATCACCT 


CTACAAGCAA 


42_12 


ACAACCACCT 


CTACAAGCAA 


AAV1 


ATAACCACCT 


CTACAAGCAA 


AAV2 


ACAACCACCT 


CTACAAACAA 


AAV 3 


ACAACCATCT 


CTACAAGCAA 


AAV 8 


ACAACCACCT 


CTACAAGCAA 


AAV 9 


ACAACCACCT 


CTACAAGCAA 


AAV 7 


ACAACCACCT 


CTACAAGCAA 


AAV10 


ACAACCACAT 


CTACAAGCAA 


AAV11 


ACAACCACCT 


CTACAAACAA 


AAV12 


ACAACCACCT 


CTACAAGCAA 


44 2 


ACAACCACCT 


CTACAAGCAA 



3050 

ATATCAA. .G TCAGAGCGGG GCT . . . .ACC 
ATCTCCAACG GGACATCGGG AGGAAGCACC 
ATCTCCAACG GGACATCGGG AGGAAGCACC 
ATCTCCAACG GGACATCGGG AGGAAGCACC 
ATCTCCAACG GGACATCGGG AGGAAGCACC 
ATCTCCAACG GGACATCGGG AGGAAGCACC 
ATCTCCAACG GGACATCGGG AGGAAGCACC 
ATATCAA ... . GTCAGAGCG GGGC . . TACC 
ATATCAA ... . GTCAGAGCG GGGC . . TACC 
ATATCAA. .G TCAGAGCGGG GCT A . . . . CC 

ATATCAA . . G TCAGAGCGGG GCTA CC 

ATATCAA. .G TCAGAGCGGG GCTA. . . . CC 
ATCTCCAACG GGACATCGGG AGGAAGCACC 
ATCTCCAACG GGACATCGGG AGGAAGCACT 
ATCTCCAACG GGACATCGGG AGGAAGCACT 
ATCTCCAACG GGACATCGGG AGGAAGCACT 
ATCTCCAACG GCACCTCGGG AGGAAGCACC 
ATCTCCAACG GCACCTCGGG AGGAAGCACC 
ATCTCCAACG GCACCTCGGG AGGAAGCACC 
ATCTCCAACG GCACCTCGGG AGGAAGCACC 
ATCTCCAACG GGACTTCGGG AGGAAGCACC 
ATCTCCAACG GGACTTCGGG AGGAAGCACC 
ATCTCCAGTC AGTCAGCAGG GAG . . . CACC 
ATCTCCAGTC AGTCAGCAGG GAG . . . CACC 
ATCTCCAGTC AGTCAGCAGG GAG . ■. . CACC 
ATCTCCAGTC AGTCAGCAGG GAG. . .CACC 
ATCTCCAGTC AGTCAGCAGG GAG . . . CACC 
ATCTCCAGTC AGTCAGCAGG GAG . . . CACC 
ATCTCCA... GCGAATCGGG AGC . . . CACC 
ATCTCCA... GCGAATCGGG AGC ... CACC 
ATCTCCA. . . GCGAATCGGG AGC. . .CACC 
ATCTCCA. 1 . GCGAATCGGG AGC. . .CACC 
ATCTCCAACG GGACATCGGG AGGAAGCACC 
ATCTCCAGTG CTTCAACGGG . GG . . CCAGC 
ATTTCCA... GCCAATCAGG AGC . . . CTCG 
ATCTCCA. . . GCCAATCAGG AGC. . . TTCA 
ATCTCCAACG GGACATCGGG AGGAGCCACC 
ATCTCCAATG GAACATCGGG AGGAAGCACC 
ATCTCCAGTG AAACTGCAGG TAG... TACC 
ATCTCCAGCG AGACAGGAGC CACCAACGAC 
ATCTCCAGCG CTTCAACGGG GGCCAGCAAC 
ATCTCCAGCC AATCGGGTGC CACCAACGAC 
ATCTCCAACG GGACTTCGGG AGGAAGCACC 
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3051 




42 2 


AACGACAACC 


ACTTCTTCGG 


42 8 


AACGACAACA 


CCTACTTCGG 


42 15 


AACGACAACA 


CCTACTTCGG 


42 5b 


AACGACAACA 


CCTACTTCGG 


42 lb 


AACGACAACA 


CCTACTTCGG 


42 13 


AACGACAACA 


CCTACTTCGG 


42 3a 


AACGACAACA 


CCTACTTCGG 


42 4 


AACGACAACC 


ACTTCTTCGG 


4 2 5a ' 


AACGACAACC 


ACTTCTTCGG 


42 10 


AACGACAACC 


ACTTCTTCGG 


42 3b 


AACGACAACC 


ACTTCTTCGG 


42 11 


AACGACAACC 


ACTTCTTCGG 


42 6b 


AACGACAACA 


CCTACTTCGG 


43 1 


AACGACAACA 


CCTACTTTGG 


43 5 


AACGACAACA 


CCTACTTTGG 


43 12 


AACGACAACA 


CCTACTTTGG 


43 20 


AACGACAACA 


CCTATTTTGG 


43 21 


AACGACAACA 


CCTATTTTGG 


43 23 


AACGACAACA 


CCTATTTTGG 


43 25 


AACGACAACA 


CCTATTTTGG 


4 4 1 


AACGACAACA 


CCTACTTCGG 
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TCTATTTCGG 
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AACGATAACG 


TCTATTTCGG 
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AACGATAACG 


TCTATTTCGG 
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AACGATAACG 


TCTATTTCGG 
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AACGACAACC 


ACTACTTCGG 
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AACGACAACC 


ACTACTTCGG 


A3 7 


AACGACAACC 


ACTACTTCGG 
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AACGACAACC 


ACTACTTCGG 
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AACGACAACA 


CCTACTTCGG 


AAV1 


AACGACAACC 


ACTACTTCGG 


AAV 2 


AACGACAATC 


ACTACTTTGG 
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AACGACAACC 


ACTACTTTGG 
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AACGACAACA 


CCTACTTCGG 
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AACGACAACA 


CCTACTTTGG 
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AACGACAACA 


CCTACTTCGG 
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AACCACTACT 


TCGGCTACAG 
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ACTACTTTGG 
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AACCACTACT 


TCGGCTA. . . 


44 2 


AACGACAACA 


CCTACTTCGG 



Fig. 1AAJ 

3100 

CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGATTT 
CTACAGCACC CCTTGGGGGT ATTTTGACTT 
CTACAGCACC CCTTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 

C ACC CCCTGGGGGT ATTTTGACTT 

CTACAGCACC CCCTGGGGGT ATTTTGACTT 
. . . CAGCACC CCTTGGGGGT ATTTTGATTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
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TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
TAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
TAACAGATTC 
TAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
CAACAGATTC 
TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
CAACAGATTC 
CAACAGATTC 
TAACAGATTC 
TAACAGATTC 
CAACAGATTC 
TAACAGATTC 
TAACAGATTC 
TAACAGATTC 
CAACAGATTC 
TAACAGATTC 



CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGTCACT 
CACTGTCACT 
CACTGTCACT 
CACTGTCACT 
CACTGCCACT 
CACTGCCACT 
CATTGCCACT 
CATTGCCACT 
CATTGCCACT 
CATTGCCACT 
CATTGCCACT 
CATTGCCACT 
CACTGTCACT 
CACTGTCACT 
CACTGTCACT 
CACTGTCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCACT 
CACTGCCATT 
CACTGCCACT 



Fig. 1AAK 



TCTCACCACG 
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TCTCACCACG 
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TCTCACCACG 
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TCTCACCACG 
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TTTCACCACG 
TCTCACCACG 
TTTCACCACG 
TCTCACCACG 
TCTCACCACG 
TTTCACCACG 
TCTCACCACG 
TCTCACCACG 
TCTCACCACG 



TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAA 
TGACTGGCAA 
TGACTGGCAA 
TGACTGGCAA 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAA 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 
TGACTGGCAG 



3150 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTTATCA 
CGACTTATCA 
CGACTTATCA 
CGACTTATCA 
CGACTTATCA 
CGACTTATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
AGACTCATCA 
CGACTCATTA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
CGACTCATCA 
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ACAACAACTG 


GGGATTCCGG 
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ACAACAACTG 


GGGATTCCGG 
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ACAACAACTG 


GGGATTCCGG 


42 5b 


ACAACAACTG 


GGGATTCCGG 
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GGGATTCCGG 
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GGGATTCCGG 


223 4 
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GGGATTCCGG 


223 5 
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GGGATTCCGG 


223 6 


ACAACAACTG 


GGGATTCCGG 


223 7 


ACAACAACTG 


GGGATTCCGG 


A3 4 


ACAACAACTG 


GGGATTTAGA 


A3 5 


ATAACAACTG 


GGGATTTAGA 


A3 7 


ACAACAACTG 


GGGATTTAGA 


A3 3 


ACAACAACTG 


GGGATTTAGA 


42 12 


ACAACAACTG 


GGGATTCCGG 


AAV1 


ACAACAATTG 


GGGATTCCGG 


AAV 2 


ACAACAACTG 


GGGATTCCGA 


AAV 3 


ACAACAACTG 


GGGATTCCGG 


AAV8 


ACAACAACTG 


GGGATTCCGG 


AAV 9 


ACAACAACTG 


GGGATTCCGG 


AAV 7 


ACAACAACTG 


GGGATTCCGG 


AAV10 
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GGGATTC 


AAV 11 
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GGGATTC 


AAV12 
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GGGATTC 


44 2 


ACAACAACTG 


GGGATTCCGG 



Fig. 1AAL 

3200 

CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAAAGAC TCAACTTCAA GCTGTTCAAC 
CCCAAAAGAC TCAACTTCAA GCTGTTCAAC 
.CCCAAAAGAC TCAACTTCAA GCTGTTCAAC 
CCCAAAAGAC TCAACTTCAA GCTGTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC CCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC - 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAAC TCAATTTCAA GCTCTTCAAC 
CCCAAGAAAC TCAATTTCAA GCTCTTCAAC 
CCCAAGAAAC TCAATTTCAA GCTCTTCAAC 
CCCAAGAAAC TCAATTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA ACTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTTAAC 
CCCAAGAAAC TCAGCTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAGCTTCAA GCTCTTCAAC 
CCAAAGAGAC TCAACTTCAA GCTGTTCAAC 
CCCAAGAAGC TGCGGTTCAA GCTCTTCAAC 



CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
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ATCCAGGTCA 
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ATCCAGGTCA 


42 10 


ATCCAGGTCA 
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ATCCAGGTCA 
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ATCCAGGTCA 


42 6b 


ATCCAGGTCA 
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ATCCAGGTCA 


43 5 


ATCCAGGTCA 


43 12 


ATCCAGGTCA 
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ATCCAGGTCA 
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ATCCAGGTCA 
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ATCCAGGTCA 
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ATCCAGGTCA 


44 1 


ATCCAGGTCA 


44_5 


ATCCAGGTCA 
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ATCCAGGTCA 
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ATCCAGGTCA 


223 4 


ATCCAGGTCA 
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ATCCAGGTCA 


223 6 


ATCCAGGTCA 
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ATCCAGGTCA 


A3 4 


ATCCAAGTCA 


A3 5 


ATCCAAGTCA 


A3 7 


ATCCAAGTCA 


A3 3 


ATCCAAGTCA 
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ATCCAGGTCA 
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ATCCAAGTCA 
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ATCCAGGTCA 


44 2 


ATCCAGGTCA 



Fig. 1AAM 



AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GACGAACGAC 
AGGAGGTCAC GACGGACGAC 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAAGTCAC GACGAACGAA 
AGGAAGTCAC GACGAACGAA 
AGGAAGTCAC GACGAACGAA 
AGGAAGTCAC GACGAACGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GCAGAATGAT 
AGGAGGTCAC GCAGAATGAT 
AGGAGGTCAC GCAGAATGAT 
AGGAGGTCAC GCAGAATGAT 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTCAC GACGAATGAT 
AAGAGGTCAC GCAGAATGAC 
GAGGGGTCAC GCAGAACGAT 
AGGAGGTCAC GCAGAATGAA 
AGGAGGTTAC GACGAACGAA 
AGGAGGTCAC GACGAATGAC 
AGGAGGTCAC GCAGAATGAA 



3250 

GGCGTTACGA CCATCGCTAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCTAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGTGTCACAA CCATCGCTAA 
GGTGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGTGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTCACAA CCATCGCTAA 
GGTACGACGA CGATTGCCAA 
GGCACGACGA CTATTGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCACCAAGA CCATCGCCAA 
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42 2 


TAACCTTACC 


42 8 


TAACCTTACC 
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TAACCTTACC 


42 5b 
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TAACCTTACC 
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TAACCTTACC 
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TAACCTTACC 
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TAACCTTACC 
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TAACCTTACC 
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TAATCTCACC 
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TAATCTCACC 
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TAATCTCACC 
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TAATCTCACC 
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TAACCTTACC 


44 5 


TAACCTTACC 
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TAACCTTACC 


223 2 


TAACCTTACC 


223 4 


TAACCTTACC 


223 5 


TAACCTTACC 


223 6 


TAACCTTACC 


223 7 


TAACCTTACC 


A3 4 


TAACCTTACC 


A3 5 


TAACCTTACC 


A3 7 


TAACCTTACC 


A3 3 


TAACCTTACC 


42 12 


TAACCTTACC 


AAV1 


TAACCTTACC 
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AAV 3 
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AAV 8 
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AAV 9 
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44 2 


TAACCTTACC 



Fig. 1AAN 



AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC. AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTCTTCTC 
AGCACGATTC AGGTGTTTAC 
AGCACGATTC AGGTGTTTAC 
AGCACGATTC AGGTGTTTAC 
AGCACCGTGC AGGTCTTTAC 
AGCACCGTGC GGGTCTTTAC 
AGCACCGTGC AGGTCTTTAC 
AGCACCGTGC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGATTC AGGTCTTTAC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTTC AGGTCTTTTC 
AGCACGGTGC AGGTCTTCAC 
AGCACGGTGC AGGTCTTCAC 
AGCACGGTGC AGGTCTTCAC 
AGCGCGGTGC AGGTCTTCAC 
AGCACGATTC AGGTCTTTAC 
AGCACGGTTC AAGTCTTCTC 
AGCACGGTTC AGGTGTTTAC 
AGCACGGTTC AAGTGTTTAC 
AGCACCATCC AGGTGTTTAC 
AGCACCGTCC AGGTCTTTAC 
AGCACGATTC AGGTATTCTC 
AGCACGATTC AGGTCTTTAC 



3300 

GGACTCGGAG TACCAACTGC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCGGCTCC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAG TACCAACTGC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAG TACCAGTTAC 
GGACTCGGAG TACCAGTTAC 
GGACTTGGAG TACCAGTTAC 
GGACTCGGAG TACCAGTTAC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAA TATCAACTGC 
GGACTCGGAA TATCAACTGC 
GGACTCGGAA TATCAACTGC 
GGACTCGGAA TATCAACTGC 
GGACTCGGAA TATCAACTGC 
GGACCCGGAA TATCAACTGC 
AGACTCTGAG TACCAGCTGC 
AGACTCTGAG TACCAGCTGC 
AGACTCTGAG TACCAGCTGC 
AGACTCTGAG TACCAGCTGC 
GGACTCGGAA TACCAGCTCC 
GGACTCGGAG TACCAGCTTC 
TGACTCGGAG TACCAGCTCC 
GGACTCGGAG TATCAGCTCC 
GGACTCGGAG TACCAGCTGC 
GGACTCGGAG TACCAGCTAC 
GGACTCGGAA TACCAGCTGC 
GGACTCGGAA TACCAGCTCC 
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42 2 


CGTACGTCCT 


42 8 


CGTACGTCCT 


42 15 


CGTACGTCCT 


42 5b 


CGTACGTCCT 


42 lb 


CGTACGTCCT 


42 13 


CGTACGTCCT 
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CGTACGTCCT 


42_4 


CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTCCC 
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CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTGCT 
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CGTACGTGCT 
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CGTACGTGCT 
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CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTCCT 
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CGTACGTCCT 


A3 7 
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A3 3 


CCTACGTCCT 
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CGTACGTCCT 


AAV1 


CGTACGTCCT 


AAV 2 


CGTACGTCCT 


AAV3 


CGTACGTGCT 


AAV 8 


CGTACGTTCT 
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CGTACGTCCT 


AAV 7 


CGTACGTCCT 


44 2 


CGTACGTCCT 



Fig. 1AAO 



CGGCTOTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
AGGATCCGCT CACCAGGGAT 
AGGATCCGCT CACCAGGGAT 
AGGATCCGCT CACCAGGGAT 
AGGATCCGCT CACCAGGGAT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGTTCGGCT CACCAGGGCT 
CGGTTCGGCT CACCAGGGCT 
CGGTTCGGCT CACCAGGGCT 
CGGTTCGGCT CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCGGCG CATCAAGGAT 
CGGGTCGGCG CACCAAGGCT 
CGGCTCTGCC CACCAGGGCT 
AGGCTCTGCC CACCAAGGAT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
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GCCTCCCTCC GTTCCCTGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCCGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG ' 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCGGCG 
GCCTCCCTCC GTTCCCGGCG 
GCCTCCCTCC GTTCCCGGCG 
GTCTGCCTCC GTTCCCGGCG 
GTCTGCCTCC GTTCCCGGCG 
GTCTGCCTCC GTTCCCGGCG 
GTCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTTCCGCC GTTCCCAGCA 
GCCTTCCGCC GTTCCCAGCA 
GCCTTCCGCC GTTCCCAGCA 
GCCTTCCGCC GTTCCCAGCA 
GCCTGCCTCC GTTCCCGGCG 
GCCTCCCTCC GTTCCCGGCG 
GCCTCCCGCC GTTCCCAGCA 
GTCTCCCGCC GTTTCCAGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCACC GTTTCCTGCA 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
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42 2 


GACGTGTTCA 


TGATTCCTCA 


42 8 


GACGTCTTCA 


TGATTCCTCA 


42_15 


GACGTCTTCA 


TGATTCCTCA 


42_5b 


GACGTCTTCA 


TGATTCCTCA 


42__lb 


GACGTCTTCA 


TGATTCCTCA 


42 13 


GACGTCTTCA 


TGATTCCTCA 


42 3a 


GACGTCTTCA 


TGATTCCTCA 


a r\ a 

42 4 


GACGTCTTCA 


TGATTCCTCA 


42 5a 


GACGTGTTCA 


TGATTCCTCA 


42 10 


GACGTGTTCA 


TGATTCCTCA 


42_3b 


GACGTGTTCA 


TGATTCCTCA 


42 1 


GACGTGTTCA 


TGATTCCTCA 


42_6b 


GACGTGTTCA 


TGATTCCTCA 


43_1 


GACGTCTTCA 


TGATTCCTCA 


43_5 


GACGTCTTCA 


TGATTCCTCA 


4 3_12 


GACGTCTTCA 


TGATTCCTCA 


43_20 


GACGTCTTCA 


CGGTTCCTCA 


43_21 ' 


GACGTCTTCA 


TGGTTCCTCA 


43J23 


GACGTCTTCA 


TGGTTCCTCA 


43_25 


GACGTCTTCA 


TGGTTCCTCA 


4 4_1 


GACGTCTTCA 


TGATTCCTCA 


44 5 


GACGTCTTCA 


TGATTCCTCA 


223 10 


GACGTGTTCA 


TGATTCCGCA 


223_2 


GACGTGTTCA 


TGATTCCGCA 


223_4 


GACGTGTTCA 


TGATTCCGCA 


223__5 


GACGTGTTCA 


TGATTCCGCA 


223_6 


GACGTGTTCA 


TGATTCCGCA 


223J7 


GACGTGTTCA 


TGATTCCGCA 


A3 4 


GACGTCTTCA 


TGATTCCTCA 


A3 5 


GACGTCTTCA 


TGATTCCTCA 


A3 7 


GACGTCTTCA 


TGATTCCTCA 


A3 3 


GACGTCTTCA 


TGATTCCTCA 


42JL2 


GACGTCTTCA 


TGATTCCTCA 


AAV1 


GACGTGTTCA 


TGATTCCGCA 


AAV2 


GACGTCTTCA 


TGGTGCCACA 


AAV 3 


GACGTCTTCA 


TGGTCCCTCA 


AAV 8 


GACGTGTTCA 


TGATTCCCCA 


AAV 9 


GACGTCTTCA 


TGGTTCCTCA 


AAV 7 


GACGTCTTCA 


TGATTCCTCA 


44 2 


GACGTCTTCA 


TGATTCCTCA 



Fig. 1AAP 
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GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGGTAT CTGACCCTAA ACAATGGCAG 
GTACGGGTAT CTGACCCTAA ACAATGGCAG 
GTACGGGTAT CTGACCCTAA ACAATGGCAG 
GTACGGCTAT TTAACTTTAA ACAATGGAAG 
GTACGGCTAT TTAACTTTAA ACAATGGAAG 
GTACGGCTAT TTAACTTTAA ACAATGGAAG 
GTACGGCTAT TTAACTTTAA ACAATGGAAG 
GTACGGGTAC CTGACTCTGA ACAATGGCAG 
GTACGGGTAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGCTAC TTGACTCTGA ACAATGGCAG 
GTACGGCTAC TTGACTCTGA ACAATGGCAG 
GTACGGCTAC TTGACTCTGA ACAATGGCAG 
GTACGGCTAC TTGACTCTGA ACAATGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
ATACGGCTAC CTGACGCTCA ACAATGGCAG 
GTATGGATAC CTCACCCTGA ACAACGGGAG 
GTATGGATAC CTCACCCTGA ACAACGGAAG 
GTACGGCTAC CTAACACTCA ACAACGGTAG 
GTACGGCTAC CTGACGCTCA ACAATGGAAG 
GTACGGCTAC CTGACTCTCA ACAATGGCAG 
GTACGGGTAC CTGACTCTGA ACAATGGCAG 
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42 2 


TCAGTCTGTG 


42 8 


TCAGGCCGTG 


42 15 


TCAGGCCGTG 


42 5b 


TCAGGCCGTG 


42 lb 


TCAGGCCGTG 


42 13 


TCAGGCCGTG 


42 3a 


TCAGGCCGTG 


42 4 


TCAGGCCGTG 


42 5a 


TCAGTCTGTG 


42 10 


TCAGTCTGTG 


42 3b 


TCAGTCTGTG 


42 11 


TCAGTCTGTG 


42 6b 


TCAGTCTGTG 


43 1 


TCAGGCTGTG 


43 5 


TCAGGCTGTG 


43 12 


TCAGGCTGTG 


43 20 


CCAAGCCCTG 


43 21 


CCAAGCCCTG 


43 23 


CCAAGCCCTG 


43 25 


CCAAGCCCTG 


44 1 


TCAGGCCGTG 


44 5 


TCAGGCCGTG 


223 10 


CCAATCGGTA 


223 2 


CCAATCGGTA 


223 4 


CCAATCGGTA 


223 5 


CCAATCGGTA 


223 6 


CCAATCGGTA 


223 7 


CCAATCGGTA 


A3 4 


CCAAGCGGTA 


A3 5 


CCAAGCGGTA 


A3 7 


CCAAGCGGTA 


A3 3 


CCAAGCGGTA 


42_12 


TCAGGCCGTG 


AAV1 


CCAAGCCGTG 


AAV 2 


TCAGGCAGTA 


AAV 3 


TCAAGCGGTG 


AAV 8 


TCAGGCCGTG 


AAV 9 


TCAAGCGTTA 


AAV 7 


TCAGTCTGTG 


44 2 


TCAGGCCGTG 



Fig. 1AAQ 



GGACGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGACGTTCTT CATTCTACTG 
GGACGTTCTT CATTCTACTG 
GGACGTTCTT CATTCTACTG 
GGACGTTCTT CATTCTACTG 
GGCCGTTCCT CCTTCTACTG 
GGACGTTCAT CCTTTTACTG 
GGACGCTCTT CATTTTACTG 
GGACGCTCAT CCTTTTACTG 
GGACGCTCCT CCTTCTACTG 
GGACGTTCTT CTTTCTACTG 
GGACGTTCCT CCTTCTACTG 
GGCCGTTCCT CCTTCTACTG 
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CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAATAC TTCCCTTCTC 
CCTGGAATAC TTCCCTTCTC 
CCTGGAATAC TTCCCTTCTC 
TCTGGAGTAT TTCCCATCGC 
TCTGGAGTAT TTCCCATCGC 
TCTGGAGTAT TTCCCATCGC 
TCTGGAGTAT TTCCCATCGC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
TCTAGAGTAT TTTCCCTCTC 
TCTAGAGTAT TTTCCCTCTC 
TCTAGAGTAT TTTCCCTCTC 
TCTAGAGTAT TTTCCCTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAATAT TTCCCTTCTC 
CCTGGAGTAC TTTCCTTCTC 
CCTGGAGTAC TTCCCTTCGC 
CCTGGAATAC TTTCCTTCGC 
TCTGGAATAC TTCCCTTCTC 
CCTGGAGTAC TTCCCCTCTC 
CCTGGAGTAC TTTCCTTCTC 
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42 2 


AGATGCTGAG 


42 8 


AAATGCTGAG 


42 15 


AAATGCGGAG 


42 5b 


AAATGCTGAG 


42 lb 


AAATGCTGAG 


42 13 


AAATGCTGAG 


42 3a 


AAATGCTGAG 


42 4 


AAATGCTGAG 


42_5a 


AGATGCTGAG 


42 10 


AGATGCTGAG 


42 3b 


AGATGCTGAG 


42 11 


AGATGCTGAG 


42 6b 


AGATGCTGAG 


43 1 


AAATGCTGAG 


43 5 


AAATGCTGAG 


43 12 


AAATGCTGAG 


43 20 


AGATGCTGAG 


43 21 


AGATGCTGAG 


43 23 


AGATGCCGAG 


43 25 


AGATGCTGAG 


44 1 


AAATGCTGAG 


4 4 5 


AAATGCTGAG 


223 10 


AGATGCTGAG 


223 2 


AGATGCTGAG 


223 4 


AGATGCTGAG 


223 5 


AGATGCTGAG 


223 6 


AGATGCTGAG 


223 7 


AGATGCTGAG 


A3 4 


AGATGCTGAG 


A3 5 


AGATGCTGAG 


A3 7 


AGATGCTGAG 


A3 3 


AGATGCTGAG 


42_12 


AAATGCTGAG 


AAV1 


AGATGCTGAG 


AAV 2 


AGATGCTGCG 


AAV3 


AGATGCTAAG 


AAV 8 


AGATGCTGAG 


AAV 9 


AGATGCTGAG 


AAV7 


AGATGCTGAG 


44 2 


AAATGCTGAG 



Fig. 1AAR 



AACGGGCAAT AACTTTGAAT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
AACCGGCAAC AACTTTCAGT 
AACCGGCAAC AACTTTCAGT 
AACCGGCAAC AACTTTCAGT 
AACCGGCAAC AACTTTCAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTACCT 
TACCGGAAAC AACTTTACCT 
GACTGGAAAT AACTTCCAAT 
AACCGGCAAC AACTTCCAGT 
AACCGGCAAC AACTTTCAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 



3500 

TCAGCTACAC CTTTGAGGAA 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC TTTTGAGGAC 
TCAGCTATAC CTTCGAGGAT 
TTACTTACAC CTTCGAGGAC 
TCAGCTACAC TTTCGAGGAC 
TCAGCTACAG CTTCGAGGAC 
TCAGCTACCA GTTTGAGGAC 
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42 2 


GTGCCTTTCC 


42 8 


GTGCCTTTTC 


42 15 


GTGCCTTTTC 


42 5b 


GTGCCTTTTC 


42 lb 


GTGCCTTTTC 


42 13 


GTGCCTTTTC 


42 3a 


GTGCCTTTTC 


42 4 


GTGCCTTTTC 


42 5a 


GTGCCCTTTC 


42 10 


GTGCCTTTCC 


42~ 3b 


GTGCCTTTCC 


42 11 


GTGCCTTTCC 


42 6b 


GTGCCTTTCC 


43 1 


GTGCCTTTCC 


43 5 


GTGCCTTTCC 


43 12 


GTGCCTTTCC 


43 20 


GTGCCTTTCC 


43 21 


GTGCCTTTCC 


43 23 


GTGCCTTTCC 


43 25 


GTGCCTTTCC 


44 1 


GTGCCTTTTC 


44 5 


GTGCCTTTTC 


223 10 


GTGCCTTTCC 


223 2 


GTGCCTTTCC 


223 4 


GTGCCTTTCC 


223 5 


GTGCCTTTCC 


223 6 


GTGCCTTTCC 


223 7 


GTGCCTTTCC 


A3 4 


GTGCCTTTCC 


A3 5 


GTGCCTTTCC 


A3 7 


GTGCCTTTCC 


A3 3 


GTGCCTTTCC 


42__12 


GTGCCTTTTC 


AAV1 


GTGCCTTTCC 


AAV2 


GTTCCTTTCC 


AAV3 


GTACCTTTTC 


AAV 8 


GTGCCTTTCC 


AAV 9 


GTGCCTTTCC 


AAV7 


GTGCCTTTCC 


44 2 


GTGCCTTTTC 



Fig. 1AAS 



ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCATAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCATAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCTCACAGC 
ACAGCAGCTA CGCTCACAGC 
ACAGCAGCTA CGCCCACAGC 
ACAGCAGCTA CGCACACAGC 
ACAGCAGCTA CGCACACAGC 
ACAGCAGCTA CGCGCACAGC 
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CAGAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG GCCGGCTGAT 
CAGAGTCTGG GCCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAAAGCCTGG ACCGGCTGAC 
CAGAGCCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGTCTCAT 
CAGAGTTTGG ATCGCTTGAT 
CAGAGCTTGG ACCGGCTGAT 
CAGAGTCTAG ATCGACTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
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42 2 


GAATCCCCTC 


ATCGACCAGT 


42 8 


GAACCCCCTC 


ATCGACCAGT 


42 15 


GAACCCCCTC 


ATCGACCAGT 


42 5b 


GAACCCCCTC 


ATCGACCAGT 


42 lb 


GAACCCCCTC 


ATCGACCAGT 


42 13 


GAACCCCCTC 


ATCGACCAGT 


42 3a 


GAACCCCCTC 


ATCGACCAGT 


42 4 


GAACCCCCTC 


ATCGACCAGT 


42 5a 


GAACCCCCTC 


ATCGACCAGT 


42 10 


GAATCCCCTC 


ATCGACCAGT 


42 3b 


GAATCCCCTC 


ATCGACCAGT 


42 11 


GAATCCCCTC 


ATCGACCAGT 


42 6b 


GAATCCCCTC 


ATCGACCAGT 


43 1 


GAACCCTCTC 


ATCGACCAGT 


43 5 


GAACCCTCTC 


ATCGACCAGT 


43 12 


GAACCCTCTC 


ATCGACCAGT 


43 20 


GAATCCCCTC 


ATCGACCAGT 


43 21 


GAATCCCCTC 


ATCGACCAGT 


43 23 


GAATCCCCTC 


ATCGACCAGT 


43 25 


GAATCCCCTC 


ATCGACCAGT 


44 1 


GAACCCCCTC 


ATCGACCAGT 


44 5 


GAACCCCCTC 


ATCGACCAGT 


223 10 


GAATCCCCTC 


ATCGACCAGT 


223 2 


GAATCCCCTC 


ATCGACCAGT 


223 4 


GAATCCCCTC 


ATCGACCAGT 


223 5 


GAATCCCCTC 


ATCGACCAGT 


223 6 


GAATCCCCTC 


ATCGACCAGT 


223 7 


GAATCCCCTC 


ATCGACCAGT 


A3 4 


GAATCCTCTC 


ATTGACCAGT 


. A3 5 


GAATCCTCTC 


ATTGACCAGT 


A3 7 


GAATCCTCTC 


ATTGACCAGT 


A3 3 


GAATCCTCTC 


ATTGACCAGT 


4 2_12 


GAACCCCCTC 


ATCGACCAGT 


AAV1 


GAATCCTCTC 


ATCGACCAAT 


AAV2 


GAATCCTCTC 


ATCGACCAGT 


AAV 3 


GAATCCTCTT 


ATTGATCAGT 


AAV 8 


GAATCCTCTG 


ATTGACCAGT 


AAV 9 


GAACCCCCTC 


ATCGACCAGT 


AAV 7 


GAATCCCCTC 


ATCGACCAGT 


44 2 


GAACCCCCTC 


ATCGACCAGT 



Fig. 1AAT 
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ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTATTA CTTATCCAGA ACTCAGTCCA 
ACCTGTATTA CTTATCCAGA ACTCAGTCCA 
ACCTGTATTA CTTATCCAGA ACTCAGTCCA 
ACCTGTACTA CCTGGTCAGA ACGCAAACGA 
ACCTGTACTA CCTGGTCAGA ACGCAAACGA 
ACCTGTACTA CCTGGTCAGA ACGCAAACGA 
ACCTGTACTA CCTGGTCAGA ACGCAAACGA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTACTA CTTGGCCAGA ACACAGAGCA 
ACCTGTATTA CCTGAGCAAA ACTCAGGGTA 
ACCTGTATTA CCTGAGCAAA ACTCAGGGTA 
ACCTGTATTA CCTGAGCAAA ACTCAGGGTA 
ACCTGTATTA CCTGAGCAAA ACTCAGGGTA 
ACCTGTACTA CCTGGCCCGG ACCCAGAGCA 
ACCTGTATTA CCTGAACAGA ACTCAAA . AT 
ACCTGTATTA CTTGAGCAGA ACAAACACTC 
ATCTGTACTA CCTGAACAGA ACGCAAGGAA 
ACCTGTACTA CTTGTCTCGG ACTCAAACAA 
ACCTATACTA CCTGGTCAGA ACACAGACAA 
ACTTGTACTA CCTGGCCAGA ACACAGAGTA 
ACCTGTACTA CCTGTCTCGG ACTCAGTCCA 
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42 2 


CTACGG . 


.GG 


42 8 


CGGGA. 


-GG 


42 15 


CGGGA. 


.GG 


4 2 5b 


CGGGA. 


.GG 


42 lb 


CGGGA. . 
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CGGGA. . 


.GG 


4 2 3a 


CGGGA. . 


.GG 


42 4 
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TCCACAAGGG AGCTGCA. GT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
GTCCACAAGG GAGCTGCAGT 
GTCCACAAGG GAGCTGCAGT 
GTCCACAAGG GAGCTGCAGT 
GTCCACAAGG GAGCTGCAGT 
AACTCAAGGT ACTCAGCAAT 
AACTCAAGGT ACTCAGCAAT 
AACTCAAGGT ACTCAGCAAT 
AACTGGAGGG ACGCAGACTC 
AACTGGAGGG ACGCAGACTC 
AACTGGAGGG ACGCAGACTC 
AACTGGAGGG ACGCAGACTC 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
AACAACGCAG CAATCGAGAC 
AACAACGCAG CAATCGAGAC 
AACAACGCAG CAATCGAGAC 
AACAACGCAG CAATCGAGAC 
GTCCACAAGG GGGCTGCAGT 
AAGTGCCCAA AACAAGGACT 
AACCACCACG CAGTCAAGGC 
AACAACCAAC CAATCACGGC 
. ACGGCAAAT ACGCAGACTC 
. ACTGGGGGA ACTCAAACTT 
CACAGCTGGC AATCGGGAAC 
TACCGCAGGA ACTCAGCAGT 
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TCCA TCAGGCTGGG 

TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 



TCCA TCAGGCTGGG 

TCCA TCAGGCTGGG 

TCCA TCAGGCTGGG 

TCCA TCAGGCTGGG 



TGTTATTTTC TCAAGCCGGG 
TGTTATTTTC TCAAGCCGGG 
TGTTATTTTC TCAAGCCGGG 
TGGCATTCAG CCAAGCGGGT 
TGGCATTCAG CCAAGCGGGT 
TGGCATTCAG CCAAGCGGGT 
TGGCATTCAG CCAAGCGGGT 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTCAG CCAAGCTGGG 
TGCAGTTCAA CCAAGCTGGG 
TGCAGTTCAG CCAAGCTGGG 
TGCAGTTCAG CCAAGCTGGG 

TCCA TCAGGCTGGG 

TGCTGTTTAG CCGTGGGTCT 
TTCAGTTTTC TCAGGCCGGA 
TGCTTTTTAG CCAGGCTGGG 
TGGGCTTCAG CCAAGGTGGG 
TGGCATTCAG CCAAGCAGGC 
TGCAGTTTTA CCAGGGCGGG 
TGCTATTTTC TCAGGCCGGG 
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ATCAAAGAAC TGGCTGCCCG GACCCTGTTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
ATCAAAGAAC TGGCTGCCCG GACCCTGTTA 
ATCAAAGAAC TGGCTGCCCG GACCCTGTTA 
ATCAAAGAAC TGGCTGCCCG GACCCTGTTA 
ATCAAAGAAC TGGCTGCCCG GACCCTGTTA 
GGCCAAGAAC TGGCTACCTG GACCGTGTTA 
GGCCAAGAAC TGGCTACCTG GACCGTGTTA 
GGCCAAGAAC TGGCTACCTG GACCGTGTTA 
GGCTAGAAAT TGGGTGCCCG GACCTTGCTA 
GGCTAGAAAT TGGGTGCCCG GACCTTGCTA 
GGCTAGAAAT TGGGTGCCCG GACCTTGCTA 
GGCTAGAAAT TGGGTGCCCG GACCTTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
AGCAAAGAAC TGGCTGCCCG GACCTTGCTT 
AGCAAAGAAC TGGCTGCCCG GACCTTGCTT 
AGCAAAGAAC TGGCTGCCCG GACCTTGCTT 
AGCAAAGAAC TGGCTGCCCG GACCTTGCTT 
AGCAAAGAAC TGGCTGCCCG GACCTTGCTT 
AGCAAAGAAC TGGCTGCCCG GACCTTGCTT 
GGCCAAAAAC TGGCTACCGG GACCCAGCTA 
GGCCAAAAAC TGGCTACCGG GACCCAGCTA 
GGCCAAAAAC TGGCTACCGG GACCCAGCTA 
GGCCAAAAAC TGGCTACCGG GACCCAGCTA 
ATCAAAGAAC TGGCTGCCCG GACCCTGTTA 
GCCCAAAAAC TGGCTACCTG GACCCTGTTA 
GTCTAGGAAC TGGCTTCCTG GACCCTGTTA 
GGCCAGAAAT TGGCTACCTG GGCCCTGCTA 
GGCAAAGAAC TGGCTGCCAG GACCCTGTTA 
GGCTAGAAAC TGGGTACCCG GGCCTTGCTA 
AGCCAAGAAT TGGTTACCTG GACCTTGCTT 
GGCCAAAAAC TGGCTACCCG GGCCCTGCTA 
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AGACTGTCAA AAAACATAGA 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACAGTGTC 
CGCGTCTCCA CGACAGTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
AGACTGTCAA AAAACATAGA 
AGACTGTCAA AAAACATAGA 
AGACTGTCAA AAGACATAGA 
AGACTGTCAA AAAACATAGA 
CGAGTTTCCA CGACACTGTC 
CGAGTTTCCA CGACACTGTC 
CGAGTTTCCA CGACACTGTC 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
CGAATGTCTA AGACGGCTAA 
CGAATGTCTA AGACGGCTAA 
CGAATGTCTA AGACGGCTAA 
CGAATGTCTA AGACGGCTAA 
AGACTGTCAA AAAACATAGA 
CGCGTTTCTA AAACAAAAAC 
CGAGTATCAA AGACATCTGC 
AGACTTTCAA AGACTGCTAA 
CGCGTCTCAA CGACAACCGG 
CGCGTCTCCA CAACCACCAA 
AGAGTCTCCA AAACGCTGGA 
CGCGTCTCCA CGACACTGTC 
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CAGCAACAAC AACAGTAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
CAGCAACAAC AACAGTAACT 
CAGCAACAAC ACCAGTAACT 
CAGCAACAAC AACAGTAACT 
CAGCAACAAC AACAGTAACT 
GCAAAACAAC AACAGCAATT 
GCAAAACAAC AACAGCAATT 
GCAAAACAAC AACAGCAATT 
CCAGAACAAC AACAGCAACT 
CCAGAGCAAC AACAGCAACT 
CCAGAACAAC AACAGCAACT 
CCAGAACAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TGACAACAAC AACAGTGAAT 
TGACAACAAC AACAGTGAAT 
TGACAACAAC AACAGTGAAT 
TGACAACAAC AACAGTGAAT 
CAGCAACAAC AACAGTAACT 
AGACAACAAC AACAGCAATT 
GGATAACAAC AACAGTGAAT 
CGACAACAAC AACAGTAACT 
GCAAAACAAC AATAGCAACT 
CCAAAATAAC AACAGCAACT 
TCAAAACAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
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3800- 

AAATACCATC TGAATGGTAG AAATTCATTA 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAGTATCACC TGAATGGCAG AGACTCCCTG 
AAGTATCACC TGAATGGCAG AGACTCCCTG 
AAGTATCACC TGAATGGCAG AGACTCCCTG 
AAGTTTAAGC TGAACGGCCG AGACTCTCTA 
AAGTTTAAGC TGAACGGCCG AGACTCTCTA 
AAGTTTAAGC TGAACGGCCG AGACTCTCTA 
AAGTTTAAGC TGAACGGCCG AGACTCTCTA 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAATACCATT TAAATGNAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATATTACC TGAATGGAAG AAATTCTCTG 
AAATATTACC CGAATGGAAG AAATTCTCTG 
AAATATTACC TGAATGGAAG AAATTCTCTG 
AAATATTACC TGAATGGAAG AAATTCTCTG 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAATATAACC TCAATGGGCG TGAATCCATC 
AAGTACCACC TCAATGGCAG AGACTCTCTG 
AAATATCATC TCAATGGCCG CGACTCGCTG 
AAATACCATC TGAATGGAAG AAATTCATTG 
AAATTCAAGC TGAACGGGAG AGACTCGCTA 
AAATATCACC TGAACGGCAG AAACTCGTTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
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AGGACCAGTT 
AAGAGCGATT 
AAGAGCGATT 
AAGAGCGATT 
AAGAGCGATT 
AAGAGCGATT 
AAGAGCGATT 
AAGAGCGATT 
AAGAGCGATT 
AGGACCAGTT 
AGGACCAGTT 
AGGACCAGTT 
AGGACCAGTT 
AGGAGCGCTT 
AGGAGCGCTT 
AGGAGCGCTT 
ACGACCGCTT 
ACGACCGCTT 
ACGACCGCTT 
ACGACCGCTT 
AAGAGCGATT 
AAGAGCGATT 
AGGAACGCTT 
AGGAACGCTT 
AGGAACGCTT 
AGGAACGCTT 
AGGAACGCTT 
AGGAACGCTT 
AGGAAAAGTA 
AGGAAAAGTA 
AGGAAAAGTA 
AGGAAAAGTA 
AGGACCAGTT 
AAGACAAGTT 
AAGAAAAGTT 
AAGAAAAATT 
AGGAGCGTTT 
AGGACCGCTT 
AGGACCGCTT 
AAGAGCGATT 
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Fig. 1AAZ 

3900 

TGGTTTTTGG CGAAACGGGG GCTGCCAACA 
TGATGTTTGG GAAACAGGGA GCTGGAAA . 
TGATGTTTGG GAAACAGGGA GCTGGAAA 
TGATGTTTGG GAAACAGGGA GCTGGAAA. 
TGATGTTTGG GAAACAGGGA GCTGGAAA 
TGATGTTTGG GAAACAGGGA GCTGGAAA. 
TGATGTTTGG GAAACAGGGA GCTGGAAA. 
TGATGTTTGG GAAACAGGGA GCTGGAAA 
TGATGTTTGG GAAACAGGGA GCTGGAAA 
TGGTTTTTGG CAAAACGGGG GCTGCCAACA 
TGGTTTTTGG CAAAACGGGG GCTGCCAACA 
TGGTTTTTGG CAAAACGGGG GCTGCCAACA 
TGGTTTTTGG CAAAACGGGG GCTGCCAACA 
TAATGTTTGG CAAGCAGGGG GCTGGAAA 
TAATGTTTGG CAAGCAGGGG GCTGGAAA.. 
TAATGTTTGG CAAGCAGGGG GCTGGAAA 
TGATTTTTGG CAAGCAAGGA GCCGGGAA 
TGATTTTTGG CAAGCAAGGA GCCGGGAA 
TGATTTTTGG CAAGCAAGGA GCCGGGAA. 
TGATTTTTGG CAAGCAAGGA GCCGGGAA.. 
TAATGTTTGG GAAACAGGGA GCTGGAAA 
TAATGTTTGG GAAACAGGGA GCTGGAAA 
TAATTTTTGG CAAAACTGGA GCAGCTAATA 
TAATTTTTGG CAAAACTGGA GCAGCTAATA 
TAATTTTTGG CAAAACTGGA GCAGCTAATA 
TAATTTTTGG CAAAACTGGA GCAGCTAATA 
TAATTTTTGG CAAAACTGGA GCAGCTAATA 
TAATTTTTGG CAAAACTGGA GCAGCTAATA 
TCATCTTTGG AAAACAAGGC ACAGGAAC . 
TCATCTTTGG AAAACAAGGC ACAGGAAC. 
TCATCTTTGG • AAAACAAGGC ACAGGAAC. 
TCATCTTTGG AAAACAAGGC ACAGGAAC 
TGGTTTTTGG CAAAACGGGG GCTGCCAACA 
TGATTTTTGG AAAAGAGAGC GCCGGAGC 
TCATCTTTGG GAAGCAAGGC TCAGAGAA 
TAATATTTGG CAAAGAAGGG ACAACGGC. 
TGATTTTTGG CAAACAAAAT GCTGCCAG 
TCATATTTGG CAAGCAAGGA GCCGGGAA* 
TGATTTTTGG AAAAACTGGA GCAACTAACA 
TAATGTTTGG GAAACAGGGA GCTGGAAA 
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Fig. 1AAAA 

3950 

AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AGCGTGATGC TGACCAGCGA AGAAGAAATT 
AGCGTGATGC TCACCAGCGA AGAAGAAATT 
AGCGTGATGC TCACCAGCGA AGAAGAAATT 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AATGTCATGA TTACAGACGA AGAGGAAATT 
AAGGTCATGA TTACAGACGA AGAGGAAATC 
AATGTAATGA TTACGGATGA AGAAGAGATT 
GATGTCATGC TCACCAGCGA GGAAGAAATC 
CAGGTGCTGA TTACAGATGA GGAAGAAATT 
AATGTGTTAA TGACAAATGA AGAAGAAATT 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
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Fig. 1AAAB 



ATCCCGTGGC TACAGAAGAA 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ATCCCGTGGC TACAGAAGAA 

ATCCCGTGGC TACAGAACAG 

ATCCCGTGGC TACAGAAGAA 

ATCCCGTGGC TACAGAAGAA 

ACCCAGTGGC T AC AG AG C AG 

ACCCAGTGGC TACAGAGCAG 

ACCCAGTGGC TACAGAGCAG 

ACCCCGTGGC CACAGAAGAA 

ACCCCGTGGC CACAGAAGAA 

ACCCCGTGGC CACAGAAGAA 

ACCCCGTGGC CACAGAAGAA 

ACCCAGTGGC CACGGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ATCCTGTGGC TACAGAACAA 

ATCCTGTGGC TACAGAACAA 

ATCCTGTGGC TACAGAACAA 

ATCCTGTGGC TACAGAACAA 

ATCCCGTGGC TACAGAAGAA 

ACCCTGTGGC CACCGAAAGA 

ATCCCGTGGC TACGGAGCAG 

ATCCTGTGGC AACAGAGCAG 

ACCCTGTGGC TACAGAGGAA 

ACCCTGTAGC CACAGAGGAA 

ATCCTGTAGC CACGGAAGAA 

ACCCAGTGGC CACAGAACAG 



4000 

TACGGTGTGG TCTCCAGCAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TATGGTGTGG TGGCAGACAA 
TATGGTGTGG TGGCAGACAA 
TATGGTGTGG TGGCAGACAA 
TAT G GAG C AG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGTGTGG TCTCCAGCAA 
TTTGGGACCG TGGCAGTCAA 
TATGGTTCTG TATCTACCAA 
TATGGAACTG TGGCAAATAA 
TACGGTATCG TGGCAGATAA 
TACGGAGCAG TGGCCATCAA 
TACGGGATAG TCAGCAGCAA 
TACGGCGTGG TGGCCGATAA 
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Fig. 1AAAC 



4050 

GACCCCAGAC ACAGACTGTC AACAGCCAGG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
CTCCCATTGT GGGAACTGTC AACAGCCAGG 
CTCCCATTGT GGGAACTGTC AACAGCCAGG 
CTCCCATTGT GGGAACTGTC AACAGCCAGG 
AGGCGCAGAC CGGACTCGTG CACAACCAGG 
AGGCGCAGAC CGGACTCGTG CACAACCAGG 
AGGCGCAGAC CGGACTCGTG CACAACCAGG 
AGGCGCAGAC CGGACTCGTG CACAACCAGG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCTTCCTA TGGAAGTGTG GACAGCCAGG 
CAGCTTCCTA TGGAAGTGTG GACAGCCAGG 
CAGCTTCCTA TGGAAGTGTG GACAGCCAGG 
CAGCTTCCTA TGGAAGTGTG GACAGCCAGG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
ACCCTGCGAC CGGAGATGTG CATGCTATGG 
AAGCAGCTAC CGCAGATGTC AACACACAAG 
CTCCCACGAC TGGAACTGTC AATCATCAGG 
CTCCTCAAAT TGGAACTGTC AACAGCCAGG 
AGGCGCAAAC TGGACTTGTG CATAACCAGG 
CAGCCCAGAC ACAAGTTGTC AACAACCAGG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
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Fig. 1AAAD 



CGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGCC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
TGGTATGGTC TGGCAAAACC 
TGGTATGGTC TGGCAAAACC 
TGGTATGGTC TGGCAAAACC 
CGGCATGGTG TGGCAGAATA 
CGGCATGGTG TGGCAGAATA 
CGGCATGGTG TGGCAGAATA 
CGGCATGGTG TGGCAGAATA 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
CGGCATGGTC TGGCAGAACC 
TGGCATGGTG TGGCAAGATA 
AGGCATGGTC TGGCAGGACA 
TGGCATGGTG TGGCAAGATC 
CGGTATGGTC TGGCAGAACC 
TGGTATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 



4100 

GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGC 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GGGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGATGTGTA CCTTCAGGGG 
GTGACGTGTA CCTTCAAGGA 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
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4150 

CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CATCCTTCGC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGAAACTTT CATCCCTCGC 
CTCACACGGA CGGAAACTTT CATCCCTCGC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGACACTTT CATCCTTCTC 
CTCACACGGA CGGACACTTT CATCCTTCTC 
CTCACACGGA CGGACACTTT CATCCTTCTC 
CTCACACGGA CGGACACTTT CATCCTTCTC 
CTCACACGGA CGGCAACTTT CACCCGTCTC 
CTCACACAGA TGGACACTTT CACCCGTCTC 
CACACACGGA CGGACATTTT CACCCCTCTC 
CTCACACGGA TGGACACTTT CATCCTTCTC 
CTCACACGGA CGGCAACTTC CACCCGTCTC 
CTCACACAGA TGGCAACTTT CACCCGTCTC 
CTCACACGGA TGGCAACTTT CACCCGTCTC 
CTCACACGGA CGGAAACTTT CATCCCTCGC 
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A3 7 


CGCTCATGGG 


A3 3 
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CCCTGATGGG 


AAV1 
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AAV 9 


CTCTGATGGG 


AAV7 
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44 2 


CGCTGATGGG 
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CGGATTTGGA CTCAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGATTTGGA CTCAAACACC 
CGGCTTTGGA CTCAAGAACC 
TGGATTCGGA X^TTAAACACC 
AGGCTTTGGA CTGAAACATC 
CGGCTTTGGC CTGAAACATC 
TGGATTTGGA CTGAAACACC 
CGGCTTTGGA CTTAAACATC 
AGGCTTTGGA CTGAAACACC 



4200 

CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTCATC 
CTCCTCCACA GATTCTCATC 
CGCCTCCTCA AATCATGATC 
CTCCGCCTCA GATCCTGATC 
CACCTCCACA GATTCTAATT 
CGCCTCCTCA GATCCTGATC 
CGCCTCCTCA GATCCTGATT 
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4 2_2 AAAAACACCC CGGTACCTGC TAATCCTCCA GAGGTGTTTA CTCCTGCCAA 

4 2_8 AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCCAA 

42_15 AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCCAA 

42_5b AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCCAA 

42_lb AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCCAA 

42_13 AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCCAA 

42_3a AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCCAA 

42_4 AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCCAA 

42_5a AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCCAA 

42_10 AAAAACACCC CGGTACCTGC TAATCCTCCA GAGGTGTTTA CTCCTGCCAA 

42_3b AAAAACACCC CGGTACCTGC TAATCCTCCA GAGGTGTTTA CTCCTGCCAA 

42JL1 AAAAACACCC CGGTACCTGC TAATCCTCCA GAGGTGTTTA CTCCTGCCAA 

4 2_6b AAAAACACCC CGGTACCTGC TAATCCTCCA GAGGTGTTTA CTCCTGCCAA 

4 3_1 AAAAACACTC CTGTTCCTGC GGATCCTCCG ACCACCTTCA GCCAGGCCAA 

4 3_5 AAAAACACTC CTGTTCCTGC GGATCCTCCG ACCACCTTCA GCCAGGCCAA 

4 3_12 AAAAACACTC CTGTTCCTGC GGATCCTCCG ACCACCTTCA GCCAGGCCAA 

43_20 AAGAACACAC CGGTTCCAGC GGACCCGCCG CTTACCTTCA ACCAGGCCAA 

4 3_21 AAGAACACAC CGGTTCCAGC GGACCCGCCG CTTACCTTCA ACCAGGCCAA 

4 3_23 AAGAACACAC CGGTTCCAGC GGACCCGCCG 'CTTACCTTCA ACCAGGCCAA 

4 3_25 AAGAACACAC CGGTTCCAGC GGACCCGCCG CTTACCTTCA ACCAGGCCAA 

44_1 AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCTAA 

4 4_5 AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCTAA 

223_10 AAAAACACAC CGGTACCTGC TAATCCTCCA GAAGTGTTTA CTCCTGCCAA 

223_2 AAAAACACGC CGGTACCTGC TAATCCTCCA GAAGTGTTTA CTCCTGCCAA 

223_4 AAAAACACAC CGGTACCTGC TAATCCTCCA GAAGTGTTTA CTCCTGCCAA 

223_5 AAAAACACAC CGGTACCTGC TAATCCTCCA GAAGTGTTTA CTCCTGCCAA 

223_6 AAAAACACAC CGGTACCTGC TAATCCTCCA GAAGTGTTTA CTCCTGCCAA 

223_7 AAAAACACAC CGGTACCTGC TAATCCTCCA GAAGTGTTTA CTCCTGCCAA 

A3_4 AAAAACACAC CTGTGCCAGC GAATCCCGCG ACCACTTTCA CTCCTGGAAA 

A3_5 AAAAACACAC CTGTGCCAGC GAATCCCGCG ACCACTTTCA CTCCTGGAAA 

A3_7 AAAAACACAC CTGTGCCAGC GAATCCCGCG ACCACTTTCA CTCCTGGAAA 

A3_3 AAAAACACAC CTGTGCCAGC GAATCCCGCG ACCACTTTCA CTCCTGGAAA 

42_12 A... A 

AAV1 AAAAACACGC CTGTTCCTGC GAATCCTCCG GCGGAGTTTT CAGCTACAAA 

AAV2 AAGAACACCC CGGTACCTGC GAATCCTTCG ACCACCTTCA GTGCGGCAAA 

AAV 3 AAAAATACTC CGGTACCGGC AAATCCTCCG ACGACTTTCA GCCCGGCCAA 

AAV 8 AAGAACACGC CTGTACCTGC GGATCCTCCG ACCACCTTCA ACCAGTCAAA 

AAV 9 AAAAATACAC CAGTGCCGGC AGATCCTCCT CTTACCTTCA ATCAAGCCAA 

AAV7 AAGAACACTC CCGTTCCCGC TAATCCTCCG GAGGTGTTTA CTCCTGCCAA 

44_2 AAGAATACAC CTGTTCCCGC GGATCCTCCA ACTACCTTCA GTCAAGCTAA 
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GGT 


CAGCGTGGAA 
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CGGACA. 


GGT 


CAGCGTGGAA 


A3 3 
4 2_12 


GTTTGCTTCG 


TTCATTACCC 


AGTATTCCAC 


CGGACA. 


GGT 


CAGCGTGGAA 
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TTCATCACAC 


AGTACTCCAC 


GGGACACGGT 


CAGCGTGGAG 
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GTTTGCTTCA 


TTTATCACTC 


AGTACTCCAC 


TGGACA. 


GGT 


CAGCGTGGAA 


AAV8 


GCTGAACTCT 


TTCATCACGC 


AATACAGCAC 


CGGACA. 


GGT 


CAGCGTGGAA 


AAV 9 


GCTGAACTCT 


TTCATCACGC 


AGTACAGCAC 


GGGACA. 


AGT 


CAGCGTGGAA 


AAV7 


GTTTGCTTCG 


TTCATCACAC 


AGTACAGCAC 


CGGACA. 


AGT 


CAGCGTGGAA 


44 2 


GCTGGCGTCG 


TTCATCACGC 


AGTACAGCAC 


CGGACA . 


GGT 


CAGCGTGGAA 
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42_2 ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

42_8 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

4 2_15 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_5b ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

4 2_lb ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_13 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_3a ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_4 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_5a ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_10 ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 2_3b ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

42_11 ATCGAGTGGG AACTGCAGAA AGAGAACAGC AAACGCTGGA ATCCAGAGAT 

42_6b ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

43_1 ATCGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ACCCAGAGAT 

43_5 ATCGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ACCCAGAGAT' 

43_12 ATCGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ACCCAGAGAT 

43_20 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

43_21 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

43_23 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 3_25 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

44_1 ATTGAATGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAGAT 

4 4_5 ATTGAATGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAGAT 

223_10 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_2 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_4 ATCGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_5 ATCGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_6 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_7 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

A3__4 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

A3_5 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCGGAAAT 

A3_7 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

A3_3 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

42_12 

AAV1 ATTGAATGGG AGCTGCAGAA AGAAAACAGC 

AAV 2 ATCGAGTGGG AGCTGCAGAA GGAAAACAGC 

AAV 3 ATTGAGTGGG AGCTACAGAA AGAAAACAGC 

AAV 8 ATTGAATGGG AGCTGCAGAA GGAAAACAGC 

AAV 9 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC 

AAV 7 ATCGAGTGGG AGCTGCAGAA GGAAAACAGC 

44 2 ATTGAATGGG AGCTGCAGAA AGAAAACAGC 



AAGCGCTGGA ATCCCGAAGT 
AAACGCTGGA ATCCCGAAAT 
AAACGTTGGA ATCCAGAGAT 
AAGCGCTGGA ACCCCGAGAT 
AAGCGCTGGA ATCCAGAGAT 
AAGCGCTGGA ACCCGGAGAT 
AAACGCTGGA ACCCAGAGAT 
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4400 

CCAAGTCTAA TAAT . GTGGA ATTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT . GTGGA . CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT . GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
CCAAGTCTAA TAAT . GTGGA ATTTGCTGTC 
CCAAGTCTAA TAAT. GTGGA ATTTGCTGTC 
CCAAGTCTAA TAAT . GTGGA ATTTGCTGTC 
CCAAGTCTAA TAAT. GTGGA ATTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
ACAAATCTAC AAAT. GTGGA CTTCGCTGTT 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTT 
ACAAACAGAC TGGA. GTGGA CTTTGCTGTT 
ACAAACAGAC TGGA. GTGGA CTTTGCTGTT 
ACAAACAGAC TGGA. GTGGA CTTTGCTGTT 
ACAAACAGAC TGGA. GTGGA CTTTGCTGTT 
ACAAACAGAC TGGA. GTGGA CTTTGCTGTT 
ACAAACAGAC TGGA. GTGGA CTTTGCTGTT 
ACAAGTCGGT GAAT. GTGGA GTTTACCGTG 
ACAAGTCGGT GAAT. GTGGA GTTTACCGTG 
ACAAGTCGGT GAAT. GTGGA GTTTACCGTG 
ACAAGTCGGT GAAT. GTGGA GTTTACCGTG 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
CAAAATCTGC CAAC . GTTGA TTTTACTGTG 
ACAAGTCTGT TAATCGTGGA CTT.ACCGTG 
ACAAGTCTGT TAAT. GTGGA CTTTACTGTA 
ACAAATCTAC AAGT. GTGGA CTTTGCTGTT 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTC 
AAAAGCAGAC TGGT. GTGGA CTTTGCCGTT 
ACAAATCTAC AAAT. GTGGA CTTTGCTGTT 
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AACAACGAAG 


GGGTTTATAC 


42 6b 


AACAACGAAG 


GGGTTTATAC 


43 1 


AATACTGAGG 


GTACTTATTC 


43 5 


AATACCGAGG 


GTACTTATTC 
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AATACTGAGG 


GTACTTATTC 


43 20 


AACACGGAAG 


GAGTTTATAG 
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AACACGGAAG 


GAGTTTATAG 
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AACACGGAAG 


GAGTTTATAG 


43 25 


AACACGGAGG 


GGGTTTATAG 


44 1 


AACACAGATG 


GCACTTATTC 


44 5 


AACACAGATG 


GCACTTATTC 


223 10 


GACAGCCAGG 


GTGTTTACTC 


223 2 


GACAGCCAGG 


GTGTTTACTC 


223 4 


GACAGCCAGG 


GTGTTTACTC 


223 5 


GACAGCCAGG 


GTGTTTACTC 
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GACAGCCAGG 


GTGTTTACTC 
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GACAGCCAGG 


GTGTTTACTC 
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GACGCAAACG 


GTGTTTATTC 


A3 5 


GACGCAAACG 


GTGTTTATTC 


A3 7 


GACGCAAACG 


GTGTTTATTC 


A3 3 


GACGCAAACG 


GTGTTTATTC 


42JL2 


AATACTGAGG 


GTACTTATTC 


AAV1 


GACAACAATG 


GACTTTATAC 


AAV 2 


GATACTAATG 


GCGTGTATTC 


AAV 3 


GACACTAATG 


GTGTTTATAG 


AAV8 


AATACAGAAG 


GCGTGTACTC 


AAV9 


AATACCGAAG 


GTGTTTACTC 


AAV 7 


GACAGCCAGG 


GTGTTTACTC 


44 2 


AACACAGATG 


GCACTTATTC 
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TGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
TGAGCCTCGC CCCATTGGCA CCCGTTACCT 
TGAGCCTCGC CCCATTGGCA CCCGTTACCT 
TGAGCCTCGC CCCATTGGCA CCCGTTACCT 
TGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CTCGTTATCT 
AGAGCCTCGC CCCATTGGCA CTCGTTATCT 
AGAGCCTCGC CCCATTGGCA CTCGTTATCT 
CGAGCCTCGC CCCATTGGCA CCCGTTACCT 
CGAGCCTCGC CCCATTGGCA CCCGTTACCT 
CGAGCCTCGC CCCATTGGCA CCCGTTACCT 
CGAGCCTCGC CCCATTGGCA CCCGTTACCT 
TGAGCCTCGC CCCATTGGCA CCCGTTACCT 
TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

TGAGCCT 

TGAGCCT ]][ 

TGAGCCT 

TGAGCCT ] 

TGAGCCT 

TGAGCCT 

TGAACCCCGC CCTATTGGCA CTCGTTACCT 
TGAACCCCGC CCTATTGGCA CTCGTTACCT 
TGAACCCCGC CCTATTGGCA CTCGTTACCT 
TGAACCCCGC CCTATTGGCA CTCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCCGTTACCT 
TGAGCCTCGC CCCATTGGCA CCCGTTACCT 
AGAGCCTCGC CCCATTGGCA CCAGATACCT 
TGAACCTCGC CCTATTGGAA CCCGGTATCT 
TGAACCCCGC CCCATTGGCA CCCGTTACCT 
TGAGCCTCGC CCCATTGGTA CTCGTTACCT 
TGAGCCTCGC CCTATTGGCA CTCGTTACCT 
TGAGCCTCGC CCCATCGGCA CCCGTTACCT 
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CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGC TAATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 

CACCCGTAGC CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAC CTGTAATTAC TTGTTAATCA ATAAACCGGT TGATTCGTTT 

CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT 

CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 

CACCCGTAAT CTGTAATTGC TCGTTAATCA ATAAACCGGT TGATTCGTTT 

CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TGATTCGTTT 



TACCCGGAAC TTGTAATTTC CTGTTAATGA ATAAACCGAT TTATGCGTTT 
TACCCGGAAC TTGTAATTTC CTGTTAATGA ATAAACCGAT TTATGCGTTT 
TACCCGGAAC TTGTAATTTC CTGTTAATGA ATAAACCGAT TTATGCGTTT 
TACCCGGAAC TTGTAATTTC CTGTTAATGA ATAAGCCGAT TTATGCGTTT 
CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 
TACCCGTCCC CTGTAATTAC GTGTTAATCA ATAAACCGGT TGATTCGTTT 
GACTCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGTT TAATTCGTTT 
CACACGAAAC TTGTGAATCC TGGTTAATCA ATAAACCGTT TAATTCGTTT 
CACCCGTAAT CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 
CACCCGTAAT TTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 
CACCCGTAAT CTGTAATTGC ATGTTAATCA ATAAACCGGT TGATTCGTTT 
CACCCGTAAT CTGTAATTGC TTGTTAAT CA ATAAA CCGGT TGATTCGTTT 



vpl-3 stop 



PolyA signal 
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CAGTTGAACT 


TTGGTCTC . 
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GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC. 
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GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC . 
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GCGAAGGGCG 


AATTP 




CAGTTGAACT 


TTGGTCTC. 
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GCGAAGGGCG 


AATTCGTTTA 


AACCTGCAGG 


CAGTTGAACT 


TTGGTCTC . 


. 


. . . AAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC . 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC . 


T 


GCGAAGGGCG 


AATTP 




CAGTTGAACT 


TTGGTCTC . 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC. 


T 


GCGAAGGGCG 


AATTC ..... 




CAGTTGAACT 


TTGGTC 




. . .AAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC . 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC. 
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GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC. 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC . 


T 


GCGAAGGGCG 


AATTCGTTTA 


AACCTGCAGG 


CAGTTGAACT 


TTGGTCTC. 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC . 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC . 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC. 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC. 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC. 


T 


GCGAAGGGCG 


AATTC 




CAGTTGAACT 


TTGGTCTC . 


T 


GCGAAGGGCG 


AATTC 





CAGTTGAACT TTGGTCTC. T GCGAAGGGCG AATTCGC.GG CCGCTA.... 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTCGT.TT AAACCT 

CAGTTGAACT TTGGTCTC. T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTCCT GTCCTTCTTA TCTTATCGGT TACCATGGTT 
CAGTTGAACT TTGGTCTC . T GCGTATTTCT . . TTCTT . AT CTAGTTTCCA 
CAGTTGAACT TTGGCTCT . T GTGCACTTCT TTATCTTTAT CTTGTTTCCA 

CAGTTGAACT TTGGTCTC. T GCG 

CAGTTGAACT TTGGTCTC . T GCG . . . . . 

CAGTTGAACT TTGGTCTCCT GTGCTTCTTA TCTTATCGGT TTCCATAGCA 
CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 
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AAV3 TGGCTACTGC GTAGATAAGC AGCGGCCTGC GGCGCTTGCG CTTCGCGGTT 

AAV8 

AAV9 

AAV7 ACTGGTTACA CATTAACTGC TTGGGTGCGC TTCACGATAA GAACACTGAC 

44 2 
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4650 



CTTGGC GTAATCATGG GTCATAG. 



TCGCGA TAAAAGACTT ACGTCATCGG GTTACCCCTA GTGATGGAGT 

ATGGCG GGTTAATCAT TAACTACAAG GA.ACCCCTA GTGATGGAGT 

TACAACTGCT GGTTAATATT TAACTCTCGC CATACCTCTA GTGATGGAGT 



.GTCACCGC GGTACCCCTA GTGATGGAGT 
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Fig. 2B 
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Fig. 2C 
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Fig. 2D 
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Fig. 2F 
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102/105 



WO 03/042397 PCT/US02/33629 

Fig. 3A 



Met Pro Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 
1 5 10 15 



Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp ,Val Ala Glu 
20 25 30 



Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 
35 40 45 



Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 



Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 



Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His Val Leu Val Glu 
8 5 90 95 



Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 
100 105 110 



Arg Glu Lys Leu Val Gin Thr He Tyr Arg Gly Val Glu Pro Thr Leu 
115 120 125 



Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 ~ 135 140 



Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 



Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr He 
165 170 175 



Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 



Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 
195 200 205 



Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 
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Fig. 3B 



Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 

225 230 235 240 

Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 

245 250 255 



Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 



He Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ser 
275 280 285 



Leu Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
290 295 300 



Asn Gly Tyr Asp Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
305 310 315 320 



Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
325 330 335 



Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala Val Pro 
340 345 350 



Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
355 360 365 



Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
370 375 380 



Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 



Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
405 410 415 



He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
420 425 430 



Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 
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Fig. 3C 



Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 
450 455 460 



Glu Val Lys Glu Phe Phe Arg Trp Ala Ser Asp His Val Thr Glu Val 
465 470 • 475 480 



Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Ser Lys Arg Pro Ala 
485 490 495 



Pro Asp Asp Ala Asp He Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 
500 505 ' 510 



Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 
515 520 525 



Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met He Gin Met 
530 535 540 



Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn He 
545 550 555 560 



Cys Phe Thr His Gly Val Arg Asp Cys Leu Glu Cys Phe Pro Gly Val 
565 570 575 



Ser Glu Ser Gin Pro Val Val Arg Lys Lys Thr Tyr Arg Lys Leu Cys 
580 585 590 - 



Ala He His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys Ser Ala 
595 600 605 



Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu Gin 
610 615 ^ 620 
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SEQUENCE LISTING 

<110> The Trustees of The University of Pennsylvania 
Gao, Guangping 
Wilson, James M. 
Alvira, Mauricio 

<120> A Method of Detecting and/or Identifying Adeno-Associated 

Virus (AAV) Sequences and Isolating Novel Sequences Identified 
Thereby 

<13 0> UPN-02735PCT 

<150> US 60/350,607 
<151> 2001-11-13 

<150> US 60/341,117 
<151> 2001-12-17 

<150> US 60/377,066 
<151> 2002-05-01 

<150> US 60/386, 675 
<151> 2002-06-05 

<160> 120 

<170> Patentln version 3.1 

<210> 1 

<211> 4721 

<212> DNA 

<213> adeno-associated virus serotype 7 

<400> 1 



ttggccactc 


cctctatgcg 


cgctcgctcg 


ctcggtgggg 


cctgcggacc 


aaaggtccgc 


60 


agacggcaga 


gctctgctct 


gccggcccca 


ccgagcgagc 


gagcgcgcat 


agagggagtg 


120 


gccaactcca 


tcactagggg 


taccgcgaag 


cgcctcccac 


gctgccgcgt 


cagcgctgac 


180 


gtaaatcacg 


tcatagggga 


gtggtcctgt 


attagctgtc 


acgtgagtgc 


ttttgcgaca 


240 


ttttgcgaca 


ccacgtggcc 


atttgaggta 


tatatggccg 


agtgagcgag 


caggatctcc 


300 


attttgaccg 


cgaaatttga 


acgagcagca 


gccatgccgg 


gtttctacga 


gatcgtgatc 


360 


aaggtgccga 


gcgacctgga 


cgagcacctg 


ccgggcattt 


ctgactcgtt 


tgtgaactgg 


420 


gtggccgaga 


aggaatggga 


gctgcccccg 


gattctgaca 


tggatctgaa 


tctgatcgag 


480 


caggcacccc 


tgaccgtggc 


cgagaagctg 


cagcgcgact 


tcctggtcca 


atggcgccgc 


54 0 


gtgagtaagg 


ccccggaggc 


cctgttcttt 


gttcagttcg 


agaagggcga 


gagctacttc 


600 


caccttcacg 


ttctggtgga 


gaccacgggg 


gtcaagtcca 


tggtgctagg 


ccgcttcctg 


660 


agtcagattc 


gggagaagct 


ggtccagacc 


atctaccgcg 


gggtcgagcc 


cacgctgccc 


720 


aactggttcg 


cggtgaccaa 


gacgcgtaat 


ggcgccggcg 


gggggaacaa 


ggtggtggac 


780 


gagtgctaca 


tccccaacta 


cctcctgccc 


aagacccagc 


ccgagctgca 


gtgggcgtgg 


840 


actaacatgg 


aggagtatat 


aagcgcgtgt 


ttgaacctgg 


ccgaacgcaa 


acggctcgtg 


900 
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gcgcagcacc 


tgacccacgt 


cagccagacg 


caggagcaga 


acaaggagaa 


tctgaacccc 


960 


aattctgacg 


cgcccgtgat 


caggtcaaaa 


acctccgcgc 


gctacatgga 


gc tggtcggg 


1020 


tggctggtgg 


accggggcat 


cacctccgag 


aagcagtgga 


tccaggagga 


ccaggcctcg 


1080 


tacatctcct 


tcaacgccgc 


ctccaactcg 


cggtcccaga 


tcaaggccgc 


gctggacaat 


1140 


gccggcaaga 


tcatggcgct 


gaccaaatcc 


gcgcccgact 


acctggtggg 


gccctcgctg 


1200 


cccgcggaca 


ttaaaaccaa 


ccgcatctac 


cgcatcctgg 


agctgaacgg 


gtacgatcct 


1260 


gcctacgccg 


gctccgtctt 


tctcggctgg 


gcccagaaaa 


agttcgggaa 


gcgcaacacc 


1320 


atctggctgt 


ttgggcccgc 


caccaccggc 


aagaccaaca 


t tgcggaagc 


catcgcccac 


1380 


gccgtgccct 


tctacggctg 


cgtcaactgg 


accaatgaga 


actttccctt 


caacgattgc 


1440 


gtcgacaaga 


tggtgatctg 


gtgggaggag 


ggcaagatga 


cggccaaggt 


cgtggagtcc 


1500 


gccaaggcca 


ttctcggcgg 


cagcaaggtg 


cgcgtggacc 


aaaagtgcaa 


gtcgtccgcc 


1560 


cagatcgacc 


ccacccccgt 


gatcgtcacc 


tccaacacca 


acatgtgcgc 


cgtgattgac 


1620 


gggaacagca 


ccaccttcga 


gcaccagcag 


ccgttgcagg 


accggatgtt 


caaatttgaa 


1680 


ctcacccgcc 


gtctggagca 


cgactttggc 


aaggtgacga 


agcaggaagt 


caaagagttc 


17 4 0 


ttccgctggg 


ccagtgatca 


cgtgaccgag 


gtggcgcatg 


agttctacgt 


cagaaagggc 


1800 


ggagccagca 


aaagacccgc 


ccccgatgac 


gcggatataa 


gcgagcccaa 


gcgggcctgc 


1860 


ccctcagtcg 


cggatccatc 


gacgtcagac 


gcggaaggag 


ctccggtgga 


ctttgccgac 


1920 


aggtaccaaa 


acaaatgttc 


tcgtcacgcg 


ggcatgattc 


agatgctgtt 


tccctgcaaa 


1980 


acgtgcgaga 


gaatgaatca 


gaatttcaac 


atttgcttca 


cacacggggt 


cagagactgt 


2 04 0 


ttagagtgtt 


tccccggcgt 


gtcagaatct 


caaccggtcg 


tcagaaaaaa 


gacgtatcgg 


2100 


aaactctgcg 


cgattcatca 


tctgctgggg 


cgggcgcccg 


agattgcttg 


ctcggcctgc 


2160 


gacctggtca 


acgtggacct 


ggacgactgc 


gtttctgagc 


aataaatgac 


ttaaaccagg 


2220 


tatggctgcc 


gatggttatc 


ttccagattg 


gctcgaggac 


aacctctctg 


agggcattcg 


2280 


cgagtggtgg 


gacctgaaac 


ctggagcccc 


gaaacccaaa 


gccaaccagc 


aaaagcagga 


2340 


caacggccgg 


ggtctggtgc 


ttcctggcta 


caagtacctc 


ggacccttca 


acggactcga 


2400 


caagggggag 


cccgtcaacg 


cggcggacgc 


agcggccctc 


gagcacgaca 


aggcctacga 


2460 


ccagcagctc 


aaagcgggtg 


acaatccgta 


cctgcggtat 


aaccacgccg 


acgccgagtt 


2520 


tcaqqaqcqt 


ctgcaagaag 


atacgtcatt 


tgggggcaac 


ctcgggcgag 


cagtcttcca 


2580 


ggccaagaag 


cgggttctcg 


aacctctcgg 


tctggttgag 


gaaggcgcta 


agacggctcc 


2640 


tgcaaagaag 


agaccggtag 


agccgtcacc 


tcagcgttcc 


cccgactcct 


ccacgggcat 


2700 


cggcaagaaa 


ggccagcagc 


ccgccagaaa 


gagactcaat 


ttcggtcaga 


ctggcgactc 


2760 


agagtcagtc 


cccgaccctc 


aacctctcgg 


agaacctcca 


gcagcgccct 


ctagtgtggg 


2820 
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atctggtaca 


gtggctgcag 


gcggtggcgc 


accaatggca 


gacaataacg 


aaggtgccga 


2880 


cggagtgggt 


aatgcctcag 


gaaattggca 


ttgcgattcc 


acatggctgg 


gcgacagagt 


2940 


cattaccacc 


agcacccgaa 


cctgggccct 


gcccacctac 


aacaaccacc 


tctacaagca 


3000 


aatctccagt 


gaaactgcag 


gtagtaccaa 


cgacaacacc 


tacttcggct 


acagcacccc 


3060 


ctgggggtat 


tttgacttta 


acagattcca 


ctgccacttc 


tcaccacgtg 


actggcagcg 


3120 


actcatcaac 


aacaactggg 


gattccggcc 


caagaagctg 


cggttcaagc 


tcttcaacat 


3180 


ccaggtcaag 


gaggtcacga 


cgaatgacgg 


cgttacgacc 


atcgctaata 


accttaccag 


3240 


cacgattcag 


gtattctcgg 


actcggaata 


ccagctgccg 


tacgtcctcg 


gctctgegca 


3300 


ccagggctgc 


ctgcctccgt 


tcccggcgga 


cgtcttcatg 


attcctcagt 


acggctacct 


3360 


gactctcaac 


aatggcagtc 


agtctgtggg 


acgttcctcc 


ttctactgcc 


tggagtactt 


3420 


cccctctcag 


atgctgagaa 


cgggcaacaa 


ctttgagttc 


agctacagct 


tcgaggacgt 


3480 


gcctttccac 


agcagctacg 


cacacagcca 


gagcctggac 


cggctgatga 


atcccctcat 


3540 


cgaccagtac 


ttgtactacc 


tggccagaac 


acagagtaac 


ccaggaggca 


cagctggcaa 


3600 


tcgggaactg 


cagttttacc 


agggcgggcc 


ttcaactatg 


gccgaacaag 


ccaagaattg 


3660 


gttacctgga 


ccttgcttcc 


ggcaacaaag 


agtctccaaa 


acgctggatc 


aaaacaacaa 


3720 


cagcaacttt 


gcttggactg 


gtgccaccaa 


atatcacctg 


aacggcagaa 


actcgttggt 


3780 


taatcccggc 


gtcgccatgg 


caactcacaa 


ggacgacgag 


gaccgctttt 


tcccatccag 


3840 


cggagtcctg 


atttttggaa 


aaactggagc 


aactaacaaa 


actacattgg 


aaaatgtgtt 


3900 


aatgacaaat 


gaagaagaaa 


ttcgtcctac 


taatcctgta 


gccacggaag 


aatacgggat 


3960 


agtcagcagc 


aacttacaag 


cggctaatac 


tgcagcccag 


acacaagttg 


tcaacaacca 


4020 


gggagcctta 


cctggcatgg 


tctggcagaa 


ccgggacgtg 


tacctgcagg 


gtcccatctg 


4080 


ggccaagatt 


cctcacacgg 


atggcaactt 


tcacccgtct 


cctttgatgg 


gcggctttgg 


4140 


acttaaacat 


ccgcctcctc 


agatcctgat 


caagaacact 


cccgttcccg 


ctaatcctcc 


4200 


ggaggtgttt 


actcctgcca 


agtttgcttc 


gttcatcaca 


cagtacagca 


ccggacaagt 


4260 


cagcgtggaa 


atcgagtggg 


agctgcagaa 


ggaaaacagc 


aagcgctgga 


acccggagat 


4320 


tcagtacacc 


tccaactttg 


aaaagcagac 


tggtgtggac 


tttgccgttg 


acagccaggg 


4380 


tgtttactct 


gagcctcgcc 


ctattggcac 


tcgttacctc 


acccgtaatc 


tgtaattgca 


4440 


tgttaatcaa 


taaaccggtt 


gattcgtttc 


agttgaactt 


tggtctcctg 


tgcttcttat 


4500 


cttatcggtt 


tccatagcaa 


ctggttacac 


attaactgct 


tgggtgcgct 


tcacgataag 


4560 


aacactgacg 


tcaccgcggt 


acccctagtg 


atggagttgg 


ccactccctc 


tatgcgcgct 


4620 


cgctcgctcg 


gtggggcctg 


cggaccaaag 


gtccgcagac 


ggcagagctc 


tgctctgccg 


4 680 


gccccaccga 


gcgagcgagc 


gcgcatagag 


ggagtggcca 


a 
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<210> 2 
<211> "737 

cipsid protein of adeno-associated virus serotpye 7 

M 4 e°t°ila 2 Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 ^ 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 



20 



Ala Asn Gin Gin Lys Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 



65 



70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 1° 5 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 
130 135 I 40 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 lb ° 

Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 1'° 

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro 
180 185 

Pro Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly 
195 200 205 

Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
210 215 220 

Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 



4 



WO 03/042397 



PCT/US02/33629 



He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 

Leu Tyr Lys Gin He Ser Ser Glu Thr Ala Gly Ser Thr Asn Asp Asn 
260 265 270 

Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 

Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn 
290 295 300 

Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn lie 
305 310 315 1W 

Gin Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn 
325 330 335 

Asn Leu Thr Ser Thr He Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu 
340 345 350 

Pro Tvr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 ' 360 365 

Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 

Gly Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 

Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser 
405 410 4J -° 

Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 

Asp Arg Leu Met Asn Pro Leu. He Asp Gin Tyr Leu Tyr Tyr Leu Ala 
435 440 445 

Arg Thr Gin Ser Asn Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin 
450 455 460 

Phe Tyr Gin Gly Gly Pro Ser Thr Met Ala Glu Gin Ala Lys Asn Trp 
465 470 475 *™ 

Leu Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp 
465 490 4 " 
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Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 

Leu Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie 
530 ^ 535 540 

Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn Val Leu 
545 550 555 560 

Met Thr Asn Glu Glu Glu He Arg Pro Thr Asn Pro Val Ala Thr Glu 
565 570 575 



Glu Tyr Gly He Val Ser Ser Asn Leu Gin Ala Ala Asn Thr Ala Ala 
580 585 590 



Gin Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp 
595 600 605 



Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro 
610 615 620 



His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 
625 630 635 640 



Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro 
645 650 655 



Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He 
660 665 670 



Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu 
675 680 685 

Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser 
690 695 700 

Asn Phe Glu Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly 
705 710 715 720 



Val Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn 
725 730 735 



Leu 
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<210> 3 

<211> 623 

<212> PRT 

<213> rep protein of adeno-associated virus serotype 7 

<400> 3 

Met Pro Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 
1 1 5 10 15 

Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
20 25 30 



Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu lie 
35 40 45 



Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 

Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 * 80 



Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His Val Leu Val Glu 
85 90 95 



Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He 
100 105 HO 



Arg Glu Lys Leu Val Gin Thr He Tyr Arg Gly Val Glu Pro Thr Leu 
115 120 125 



Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 
130 135 140 



Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu Pro Lys 

145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr He 
165 170 175 



Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 
180 185 190 



Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 
195 200 205 



Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala Arg Tyr 
210 215 220 

Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly He Thr Ser Glu Lys 
225 230 235 240 
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Gin Trp He Gin Glu Asp Gin Ala Ser Tyr He Ser Phe Asn Ala Ala 
245 250 255 



Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Gly Lys 
260 265 270 



He Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly Pro Ser 
275 280 285 



Leu Pro Ala Asp He Lys Thr Asn Arg He Tyr Arg He Leu Glu Leu 
290 295 300 



Asn Gly Tyr Asp Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly Trp Ala 
305 ~ 310 315 320 



Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 
325 330 335 



Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala Val Pro 
340 345 350 



Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 
355 360 365 



Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
370 ^ 375 380 



Lys Val Val Glu Ser Ala Lys Ala He Leu Gly Gly Ser Lys Val Arg 
385 390 395 400 



Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr Pro Val 
405 410 415 



He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly Asn Ser 
420 425 430 



Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 
435 440 445 



Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 
450 " ^ 455 460 



Glu Val Lys Glu Phe Phe Arg Trp Ala Ser Asp His Val Thr Glu Val 
465 470 475 480 



Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Ser Lys Arg Pro Ala 
485 490 495 
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Pro Asp Asp Ala Asp He Ser Glu Pro Lys Arg Ala Cys Pro Ser Val 
500 505 510 

Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp Phe Ala 
515 520 525 

Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met He Gin Met 
530 535 540 

Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn He 
545 550 555 560 

Cys Phe Thr His Gly Val Arg Asp Cys Leu Glu Cys Phe Pro Gly Val 
565 570 575 

Ser Glu Ser Gin Pro Val Val Arg Lys Lys Thr Tyr Arg Lys Leu Cys 
580 585 590 

Ala He His His Leu Leu Gly Arg Ala Pro Glu He Ala Cys Ser Ala 
595 600 605 

Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu Gin 



<210> 4 
<211> 4393 
<212> DNA 

<213> adeno-associated virus serotype 8 
<400> 4 

cagagaggga gtggccaact ccatcactag gggtagcgcg aagcgcctcc cacgctgccg 60 

cgtcagcgct gacgtaaatt acgtcatagg ggagtggtcc tgtattagct gtcacgtgag 120 

tgcttttgcg gcattttgcg acaccacgtg gccatttgag gtatatatgg ccgagtgagc 180 

gagcaggatc tccattttga ccgcgaaatt tgaacgagca gcagccatgc cgggcttcta ^240 

cgagatcgtg atcaaggtgc cgagcgacct ggacgagcac ctgccgggca tttctgactc 300 

gtttgtgaac tgggtggccg agaaggaatg ggagctgccc ccggattctg acatggatcg 360 

gaatctgatc gagcaggcac ccctgaccgt ggccgagaag ctgcagcgcg acttcctggt 420 

ccaatggcgc cgcgtgagta aggccccgga ggccctcttc tttgttcagt tcgagaaggg 48 0 

cgagagctac tttcacctgc acgttctggt cgagaccacg ggggtcaagt ccatggtgct 54 0 

aggccgcttc ctgagtcaga ttcgggaaaa gcttggtcca gaccatctac ccgcggggtc 600 

gagccccacc ttgcccaact ggttcgcggt gaccaaagac gcggtaatgg cgccggcggg 660 

ggggaacaag gtggtggacg agtgctacat ccccaactac ctcctgccca agactcagcc 720 

cgagctgcag tgggcgtgga ctaacatgga ggagtatata agcgcgtgct tgaacctggc 780 
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cgagcgcaaa 


cggctcgtgg cgcagcacct gacccacgtc 


agccagacgc 


aggagcagaa 


840 


caaggagaat 


ctoaarccca attct traccrc 


crccccrt crate 

y w w y w y w w 


aggtcaaaaa 


cctccgcgcg 


900 


ctatatggag 


et crrr terra crt cf crctcrcftaaa 

^ y y y y y y wy y ^- y y 


ccggggcatc 


acctccgaga 


agcagtggat 


960 


ccaggaggac 


carrrrcc tccrt acatctcctt 


v.-* uuw y w w y w o 


tccaactcgc 


ggtcccagat 


1020 


caaggccgcg 


ct"crcrar , aa't~ci ccooci a a oat 


catggcgctg 


accaaatccg 


cgcccgacta 


1080 


cctggtgggg 


ccctcgctgc ccgcggacat 


tacccagaac 


cgcatctacc 


gcatcctcgc 


1140 


tctcaacggc 


tacgaccctg cctacgccgg 


ctccgtcttt 


ctcggctggg 


ctcagaaaaa 


1200 


gttcgggaaa 


cgcaacacca tctggctgtt 


tggacccgcc 


accaccggca 


agaccaacat 


1260 


tgcggaagcc 


atcgcccacg ccgtgccctt 


ctacggctgc 


gtcaactgga 


ccaatgagaa 


1320 


ctttcccttc 


aatgattgcg tcgacaagat 


aataatctaa 

yy LyuuuLyy 


tgggaggagg 


gcaagatgac 


1380 


ggccaaggtc 


gtggagtccg ccaaggccat 


tctcacr ccrac 

i-wyywyyw 


agcaaggtgc 


gcgtggacca 


1440 


aaagtgcaag 


tcgtccgccc agatcgaccc 


cacccccgtg 


atcgtcacct 


ccaacaccaa 


1500 


catgtgcgcc 


gtgattgacg ggaacagcac 


caccttcgag 


caccagcagc 


ctctccagga 


1560 


ccggatgttt 


aacrt tccraac tcaccccrcccr 

U- d U u v y U v4 — v* w w w y V— ' * — y 


tctcrcracrcac 


gactttggca 


aggtgacaaa 


1620 


gcaggaagtc 


aaacracrttct teerretrr aac 


caotcratcac 

w cl y w y u w w ci v^»» 


gtgaccgagg 


tggcgcatga 


1680 


gttttacgtc 




aa era ccccrcc 
y y 


cccgatgacg 


eggataaaag 


1740 


cgagcccaag 


r* rrrrrr r* r* 1" rrr* r* r*r*t r*a rrtfrrr' 


rr rr a t~ c c a t c rr 

yyuLuuciLuy 


aegtcagacg 


eggaaggage 


1800 


tccggtggac 


tttgccgaca ggtaccaaaa 


caaatgtt ct 


cgtcacgcgg 


geatgettea 


1860 


gatgctgttt 


ccctgcaaaa cgtgcgagag 


aatgaatcag 


aatttcaaca 


tttgettcac 


1920 


acacggggtc 


acr acractcrct cacracitcfttt 


cccccrcrccrtcr 

w w w- y y w y y 


tcagaatctc 


aaceggtegt 


1980 


cagaaagagg 


acgtatcgga aactctgtgc 


gattcatcat 


ctgctggggc 


gggctcccga 


2040 


gattgcttgc 




ccf to cracc tcr 

wy uy y wa w w w y 


gatgactgtg 


tttctgagca 


2100 


ataaatgact 


fsaarranrrt" a trrrrr , 't"rrr*r , rr 


at rrrrtta t ct 


tccagattgg 


ctcgaggaca 


2160 


acctctctga 


«~r rt rr r- a 1~ +■ r* rr r* rr 3 rr 1~ rr rr t" rr rr rr 
yyy uai- Luy u y«y ^yy L yyy 


Ly u uy ocicii<u 


tggagccccg 


aagcccaaag 


2220 


ccaaccagca 


aaanracrrrac r/acacrcccrcf rr 


crtctaatact 

y w w i« y y w y w w 


tcctggctac 


aagtacctcg 


2280 


gacccttcaa 


ccrcract ccrac aaaaaacraac 

*w y y *— i ' y u. >w la. y y y y y y w 


ccgtcaacgc 


ggeggacgea 


gcggccctcg 


2340 


agcacgacaa 


rrrrcct accf ac cac/cacfc toe 

y y ^ w4 » y w ^ y y y 


aaacacfcft era 
«y y^yyy ^-y«. 


caatccgtac 


ctgeggtata 


2400 


accacgccga 


cgccgagttt caggagcgtc 


tgcaagaaga 


tacgtctttt 


gggggcaacc 


2460 


tcgggcgagc 


agtcttccag gccaagaagc 


gggttctcga 


acctctcggt 


ctggttgagg 


2520 


aaggcgctaa 


gacggctcct ggaaagaaga 


gaeeggtaga 


gccatcaccc 


cagcgttctc 


2580 


cagactcctc 


tacgggcatc ggcaagaaag 


gccaacagcc 


cgccagaaaa 


agactcaatt 


2640 


ttggtcagac 


tggcgactca gagtcagttc 


cagaccctca 


acctctcgga 


gaacctccag 


2700 
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cagcgccctc 


tggtgtggga 


cctaatacaa 


tggctgcagg 


cggtggcgca 


ccaatggcag 


2760 


acaataacga 


aggcgccgac 


ggagtgggta 


gttcctcggg 


aaattggcat 


tgegattcca 


2820 


catggctggg 


cgacagagtc 


atcaccacca 


gcacccgaac 


ctgggccctg 


cccacctaca 


2880 


acaaccacct 


ctacaagcaa 


atctccaacg 


ggacatcggg 


aggagccacc 


aacgacaaca 


2940 


cctacttcgg 


ctacagcacc 


ccctgggggt 


attttgactt 


taacagattc 


cactgccact 


3000 


tttcaccacg 


tgactggcag 


cgactcatca 


acaacaactg 


gggattcegg 


cccaagagac 


3060 


tcagcttcaa 


gctcttcaac 


atccaggtca 


aggaggtcac 


gcagaatgaa 


ggcaccaaga 


3120 


ccatcgccaa 


taacctcacc 


agcaccatcc 


aggtgtttac 


ggacteggag 


taccagctgc 


3180 


cgtacgttct 


cggctctgcc 


caccagggct 


gcc.tgcctcc 


gttcccggcg 


gacgtgttca 


3240 


tgattcccca gtacggctac 


ctaacactca acaacggtag tcaggccgtg 


ggacgctcct 


3300 


ccttctactg cctggaatac 


utzticcttcgc 


ag at get gag 


aaccggcaac 


aacttccagt 


3360 


ttacttacac 


cttcgaggac 


gtgcctttcc 


acagcagcta 


cgcccacagc 


cagagcttgg 


3420 


accggctgat 


gaatcctctg 


att gaccagt 


acct gt ac t a 




actcaaacaa 


3480 


caggaggcac 


ggcaaatacg 


cagactctgg 


y<— /— 4— 4* r-* -~\ f~f f A* ^ 

yC ttCdyCCa 


~% r^r +■ rf n rt f /■* +* 

ayy tyyy CCt 


aatacaatgg 


3540 


ccaatcaggc 


aaagaactgg 




r» r* +* rr t~ a c cct 




gtctcaacga 


3600 


caaccgggca 


aaacaacaat 


agcaactttg 


cctggactgc 


tgggaccaaa 


taccatctga 


3660 


atggaagaaa 


ttcattggct 


aatcctggca 


tegctatgge 


aacacacaaa 


gacgacgagg 


3720 


agcgtttttt 


tcccagtaac 


gggatcctga 


tttttggcaa 


acaaaatget 


gecagagaca 


3780 


atgcggatta 


cagcgatgtc 


atgctcacca 


gcgaggaaga 


aatcaaaacc 


actaaccctg 


3840 


tggctacaga 


ggaatacggt 


atcgtggcag 


ataacttgea 


gcagcaaaac 


acggctcctc 


3900 


aaattggaac 


tgtcaacagc 


cagggggcct 


tacceggtat 


ggtctggcag 


aacegggacg 


3960 


tgtacctgca 


gggtcccatc 


tgggccaaga 


ttcctcacac 


ggaeggcaac 


ttccacccgt 


4020 


ctccgctgat 


gggcggcttt 


ggcctgaaac 


atcctccgcc 


tcagatcctg 


atcaagaaca 


4080 


cgcctgtacc 


tgcggatcct 


ccgaccacct 


tcaaccagtc 


aaagctgaac 


tctttcatca 


4140 


cgcaatacag 


caccggacag 


gtcagcgtgg 


aaattgaatg 


ggagctgcag 


aaggaaaaca 


4200 


gcaagcgctg 


gaaccccgag 


atccagtaca 


cctccaacta 


ctacaaatct 


acaagtgtgg 


4260 


actttgctgt 


taatacagaa 


ggcgtgtact 


ctgaaccccg 


ccccattggc 


acccgttacc 


4320 


tcacccgtaa 


tctgtaattg 


cctgttaatc 


aataaacegg 


ttgattcgtt 


tcagttgaac 


4380 


tttggtctct 


gcg 










4393 



<210> 5 

<211> 4385 

<212> DNA 

<213> adeno-associated virus serotype 9 
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<400> 5 



cagagaggga 


gtggccaact 


ccatcactag 


gggtaatcgc 


gaagcgcctc 


ccacgctgcc 


60 


gcgtcagcgc 


tgacgtagat 


tacgtcatag 


gggagtggtc 


ctgtattagc 


tgtcacgtga 


120 


gtgcttttgc 


gacattttgc 


gacaccacat 


ggccatttga 


ggtatatatg 


gccgagtgag 


180 


cgagcaggat 


ctccattttg 


accgcgaaat 


ttgaacgagc 


agcagccatg 


ccgggcttct 


240 


acgagattgt 


gatcaaggtg 


ccgagcgacc 


tggacgagca 


cctgccgggc 


atttctgact 


300 


cttttgtgaa 


ctgggtggcc 


gagaaggaat 


gggagctgcc 


cccggattct 


gacatggatc 


360 


ggaatctgat 


cgagcaggca 


cccctgaccg 


tggccgagaa 


gctgcagcgc 


gacttcctgg 


420 


tccaatggcg 


ccgcgtgagt 


aaggccccgg 


aggccctctt 


ctttgttcag 


ttcgagaagg 


480 


gcgagagcta 


ctttcacctg 


cacgttctgg 


tcgagaccac 


gggggtcaag 


tccatggtgc 


540 


taggccgctt 


cctgagtcag 


attcgggaga 


agctggtcca 


gaccatctac 


cgcgggatcg 


600 


agccgaccct 


gcccaactgg 


ttcgcggtga 


ccaagacgcg 


taatggcgcc 


ggcgggggga 


660 


acaaggtggt 


ggacgagtgc 


tacatcccca 


actacctcct 


gcccaagact 


cagcccgagc 


720 


tgcagtgggc 


gtggactaac 


atggaggagt 


atataagcgc 


gtgcttgaac 


ctggccgagc 


780 


gcaaacggct 


cgtggcgcag 


cacctgaccc 


acgtcagcca 


gacgcaggag 


cagaacaagg 


840 


agaatctgaa 


ccccaattct 


gacgcgcccg 


tgatcaggtc 


aaaaacctcc 


gcgcgctaca 


900 


tggagctggt 


cgggtggctg 


gtggaccggg 


gcatcacctc 


cgagaagcag 


tggatccagg 


960 


aggaccaggc 


ctcgtacatc 


tccttcaacg 


ccgcctccaa 


ctcgcggtcc 


cagatcaagg 


1020 


ccgcgctgga 


caatgccggc 


aagatcatgg 


cgctgaccaa 


atccgcgccc 


gactacctgg 


1080 


taggcccttc 


acttccggtg 


gacattacgc 


agaaccgcat 


ctaccgcatc 


ctgcagctca 


1140 


acggctacga 


ccctgcctac 


gccggctccg 


tctttctcgg 


ctgggcacaa 


aagaagttcg 


1200 


ggaaacgcaa 


caccatctgg 


ctgtttgggc 


cggccaccac 


gggaaagacc 


aacatcgcag 


12 60 


aagccattgc 


ccacgccgtg 


cccttctacg 


gctgcgtcaa 


ctggaccaat 


gagaactttc 


1320 


ccttcaacga ttgcgtcgac 


aagatggtga 


tctggtggga 


ggagggcaag 


atgacggcca 


1380 


aggtcgtgga 


gtccgccaag 


gccattctcg 


gcggcagcaa 


ggtgcgcgtg 


gaccaaaagt 


1440 


gcaagtcgtc 


cgcccagatc 


gaccccactc 


ccgtgatcgt 


cacctccaac 


accaacatgt 


1500 


gcgccgtgat 


tgacgggaac 


agcaccacct 


tcgagcacca 


gcagcctctc 


caggaccgga 


1560 


tgtttaagtt 


cgaactcacc 


cgccgtctgg 


agcacgactt 


tggcaaggtg 


acaaagcagg 


1620 


aagtcaaaga gttcttccgc 


tgggccagtg atcacgtgac 


cgaggtggcg 


catgagtttt 


1680 


acgtcagaaa 


gggcggagcc 


agcaaaagac 


ccgcccccga 


tgacgcggat 


aaaagcgagc 


1740 


ccaagcgggc 


ctgcccctca 


gtcgcggatc 


catcgacgtc 


agacgcggaa 


ggagctccgg 


1800 


tggactttgc 


cgacaggtac 


caaaacaaat 


gttctcgtca 


cgcgggcatg 


cttcagatgc 


1860 


tgcttccctg 


caaaacgtgc 


gagagaatga 


atcagaattt 


caacatttgc 


ttcacacacg 


1920 
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gggtcagaga 


ctgctcagag 


tgtttccccg gcgtgtcaga 


atctcaaccg 


gtcgtcagaa 


1980 


agaggacgta 


tcggaaactc 


tgtgcgattc atcatctgct 


ggggcgggct 


cccgagattg 


2040 


cttgctcggc 


ctgcgatctg 


gtcaacgtgg acctggatga 


ctgtgtttct 


gagcaataaa 


2100 


tgacttaaac 


caggtatggc 


tgccgatggt tatcttccag 


attggctcga 


ggacaacctc 


2160 


tctgagggca 


ttcgcgagtg 


gtgggcgctg aaacctggag 


ccccgaagcc 


caaagccaac 


2220 


cagcaaaagc 


aggacgacgg 


ccggggtctg gtgcttcctg 


gctacaagta 


cctcggaccc 


2280 


ttcaacggac 


tcgacaaggg 


ggagcccgtc aacgcggcgg 


acgcagcggc 


cctcgagcac 


2340 


ggcaaggcct 


acgaccagca 


gctgcaggcg ggtgacaatc 


cgtacctgcg 


gtataaccac 


2400 


gccgacgccg 


agtttcagga 


gcgtctgcaa gaagatacgt 


cttttggggg 


caacctcggg 


2460 


cgagcagtct 


tccaggccaa 


gaagcgggtt ctcgaacctc 


tcggtctggt 


tgaggaaggc 


2520 


gctaagacgg 


ctcctggaaa 


gaagagaccg gtagagccat 


caccccagcg 


ttctccagac 


2580 


tcctctacgg 


gcatcggcaa 


gaaaggccaa cagcccgcca 


gaaaaagact 


caattttggt 


2640 


cagactggcg 


actcagagtc 


agttccagac cctcaacctc 


tcggagaacc 


tccagcagcg 


2700 


ccctctggtg 


tgggacctaa 


tacaatggct gcaggcggtg 


gcgcaccaat 


ggcagacaat 


2760 


aacgaaggcg 


ccgacggagt 


gggtaattcc tcgggaaatt 


ggcattgcga 


ttccacatgg 


2820 


ctgggggaca 


gagtcatcac 


caccagcacc cgaacctggg 


cattgcccac 


ctacaacaac 


2880 


cacctctaca 


agcaaatctc 


caatggaaca tcgggaggaa 


gcaccaacga 


caacacctac 


2940 


tttggctaca 


gcaccccctg 


ggggtatttt gacttcaaca 


gattccactg 


ccacttctca 


3000 


ccacgtgact 


ggcagcgact 


catcaacaac aactggggat 


tccggccaaa 


gagactcaac 


3060 


ttcaagctgt 


tcaacatcca ggtcaaggag gttacgacga 


acgaaggcac 


caagaccatc 


3120 


gccaataacc 


ttaccagcac 


cgtccaggtc tttacggact 


cggagtacca 


gctaccgtac 


3180 


gtcctaggct 


ctgcccacca 


aggatgcctg ccaccgtttc 


ctgcagacgt 


cttcatggtt 


3240 


cctcagtacg 


gctacctgac 


gctcaacaat ggaagtcaag 


cgttaggacg 


ttcttctttc 


3300 


tactgtctgg 


aatacttccc 


ttctcagatg ctgagaaccg 


gcaacaactt 


tcagttcagc 


3360 


tacactttcg 


aggacgtgcc 


tttccacagc agctacgcac 


acagccagag 


tctagatcga 


3420 


ctgatgaacc 


ccctcatcga 


ccagtaccta tactacctgg 


tcagaacaca 


gacaactgga 


3480 


actgggggaa 


ctcaaacttt 


ggcattcagc caagcaggcc 


ctagctcaat 


ggccaatcag 


3540 


gctagaaact 


gggtacccgg gccttgctac cgtcagcagc 


gcgtctccac 


aaccaccaac 


3600 


caaaataaca 


acagcaactt 


tgcgtggacg ggagctgcta 


aattcaagct 


gaacgggaga 


3660 


gactcgctaa 


tgaatcctgg 


cgtggctatg gcatcgcaca 


aagacgacga 


ggaccgcttc 


3720 


tttccatcaa 


gtggcgttct 


catatttggc aagcaaggag 


ccgggaacga 


tggagtcgac 


3780 


tacagccagg 


tgctgattac 


agatgaggaa gaaattaaag 


ccaccaaccc 


tgtagccaca 


3840 
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gaggaatacg 


gagcagtggc 


catcaacaac 


caggccgcta 


acacgcaggc 


gcaaactgga 


3900 


cttgtgcata 


accagggagt 


tattcctggt 


atggtctggc 


agaaccggga 


cgtgtacctg 


3960 


cagggcccta 


tttgggctaa 


aatacctcac 


acagatggca 


actttcaccc 


gtctcctctg 


4020 


atgggtggat 


ttggactgaa 


acacccacct 


ccacagattc 


taattaaaaa 


tacaccagtg 


4080 


ccggcagatc 


ctcctcttac 


cttcaatcaa 


gccaagctga actctttcat 


cacgcagtac 


4140 


agcacgggac 


aagtcagcgt 


aaaaatcaaa 


tgggagctgc 


agaaagaaaa 


cagcaagcgc 


4200 


tggaatccag 


agatccagta 


tacttcaaac 


tactacaaat 


ctacaaatgt 


ggactttgct 


4260 


gtcaatacca 


aaggtgttta 


ctctgagcct 


cgccccattg 


gtactcgtta 


cctcacccgt 


4320 


aatttgtaat 


tgcctgttaa 


tcaataaacc 


ggttaattcg 


tttcagttga 


actttggtct 


4380 


ctgcg 












4385 



<210> 6 

<211> 4718 

<212> DNA 

<213> adeno-associated virus serotype 1 

<400> 6 



ttgcccactc 


cctctctgcg 


cgctcgctcg ctcggtgggg 


cctgcggacc 


aaaggtccgc 


60 


agacggcaga 


gctctgctct 


gccggcccca ccgagcgagc 


gagcgcgcag 


agagggagtg 


120 


ggcaactcca 


tcactagggg 


taatcgcgaa gcgcctccca cgctgccgcg 


tcagcgctga 


180 


cgtaaattac 


gtcatagggg 


agtggtcctg tattagctgt 


cacgtgagtg 


cttttgcgac 


240 


attttgcgac 


accacgtggc 


catttagggt atatatggcc gagtgagcga 


gcaggatctc 


300 


cattttgacc 


gcgaaatttg 


aacgagcagc agccatgccg 


ggcttctacg 


agatcgtgat 


360 


caaggtgccg 


agcgacctgg 


acgagcacct gccgggcatt 


tctgactcgt 


ttgtgagctg 


420 


ggtggccgag 


aaggaatggg 


agctgccccc ggattctgac 


atggatctga 


atctgattga 


480 


gcaggcaccc 


ctgaccgtgg 


ccgagaagct gcagcgcgac 


ttcctggtcc 


aatggcgccg 


540 


cgtgagtaag 


gccccggagg 


ccctcttctt tgttcagttc 


gagaagggcg 


agtcctactt 


600 


ccacctccat 


attctggtgg 


agaccacggg ggtcaaatcc 


atggtgctgg 


gccgcttcct 


660 


gagtcagatt 


agggacaagc 


tggtgcagac catctaccgc 


gggatcgagc 


cgaccctgcc 


720 


caactggttc 


gcggtgacca 


agacgcgtaa tggcgccgga 


ggggggaaca 


aggtggtgga 


780 


cgagtgctac 


atccccaact 


acctcctgcc caagactcag 


cccgagctgc 


agtgggcgtg 


840 


gactaacatg 


gaggagtata 


taagcgcctg tttgaacctg 


gccgagcgca 


aacggctcgt 


900 


ggcgcagcac 


ctgacccacg 


tcagccagac ccaggagcag 


aacaaggaga 


atctgaaccc 


960 


caattctgac 


gcgcctgtca 


tccggtcaaa aacctccgcg 


cgctacatgg 


agctggtcgg 


1020 


gtggctggtg 


gaccggggca 


tcacctccga gaagcagtgg 


atccaggagg 


accaggcctc 


1080 
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gtacatctcc 


ttcaacgccg 


cttccaactc 


gcggtcccag 


atcaaggccg 


ctctggacaa 


1140 


tgccggcaag 


atcatggcgc 


tgaccaaatc 


cgcgcccgac 


tacctggtag 


gccccgctcc 


1200 


gcccgcggac 


attaaaacca 


accgcatcta 


ccgcatcctg 


gagctgaacg 


gctacgaacc 


1260 


tgcctacgcc 


ggctccgtct 


ttctcggctg 


ggcccagaaa 


aggttcggga 


agcgcaacac 


1320 


catctggctg 


tttgggccgg 


ccaccacggg 


caagaccaac 


atcgcggaag 


ccatcgccca 


1380 


cgccgtgccc 


ttctacggct 


gcgtcaactg 


gaccaatgag 


aactttccct 


tcaatgattg 


1440 


cgtcgacaag 


atggtgatct 


ggtgggagga 


gggcaagatg 


acggccaagg 


tcgtggagtc 


1500 


cgccaaggcc 


attctcggcg 


gcagcaaggt 


gcgcgtggac 


caaaagtgca 


agtcgtccgc 


1560 


ccagatcgac 


cccacccccg 


tgatcgtcac 


ctccaacacc 


aacatgtgcg 


ccgtgattga 


1620 


cgggaacagc 


accaccttcg 


agcaccagca 


gccgttgcag 


gaccggatgt 


tcaaatttga 


1680 


actcacccgc 


cgtctggagc 


atgactttgg 


caaggtgaca 


aagcaggaag 


tcaaagagtt 


1740 


cttccgctgg 


gcgcaggatc 


acgtgaccga 


ggtggcgcat 


gagttctacg 


tcagaaaggg 


1800 


tggagccaac 


aaaagacccg 


cccccgatga 


cgcggataaa 


agcgagccca 


agcgggcctg 


1860 


cccct cagtc 


gcggatccat 


cgacgtcaga 


cgcggaagga 


gctccggtgg 


actttgccga 


1920 


caggtaccaa 


aacaaatgtt 


ctcgtcacgc 


gggcatgctt 


cagatgctgt 


ttccctgcaa 


1980 


gacatgcgag 


agaatgaatc 


agaatttcaa 


catttgcttc 


acgcacggga 


cgagagactg 


2040 


ttcagagtgc 


ttccccggcg 


tgtcagaatc 


tcaaccggtc 


gtcagaaaga 


ggacgtatcg 


2100 


gaaactctgt 


gccattcatc 


atctgctggg 


gcgggctccc 


gagattgctt 


gctcggcctg 


2160 


cgatctggtc 


aacgtggacc 


tggatgactg 


tgtttctgag 


caataaatga 


cttaaaccag 


2220 


gtatggctgc 


cgatggttat 


cttccagatt 


ggctcgagga 


caacctctct 


gagggcattc 


2280 


gcgagtggtg 


ggacttgaaa 


cctggagccc 


cgaagcccaa 


agccaaccag 


caaaagcagg 


2340 


acgacggccg 


gggtctggtg 


cttcctggct 


acaagtacct 


cggacccttc 


aacggactcg 


2400 


acaaggggga 


gcccgtcaac 


gcggcggacg 


cagcggccct 


cgagcacgac 


aaggcctacg 


2460 


accagcagct 


caaagcgggt 


gacaatccgt 


acctgcggta 


taaccacgcc 


gacgccgagt 


2520 


ttcaggagcg 


tctgcaagaa 


gatacgtctt 


ttgggggcaa 


cctcgggcga 


gcagtcttcc 


2580 


aggccaagaa 


gcgggttctc 


gaacctctcg 


gtctggttga 


ggaaggcgct 


aagacggctc 


2640 


ctggaaagaa 


acgtccggta 


gagcagtcgc 


cacaagagcc 


agactcctcc 


tcgggcatcg 


2700 


gcaagacagg 


ccagcagccc 


gctaaaaaga 


gactcaattt 


tggtcagact 


ggcgactcag 


2760 


agtcagtccc 


cgatccacaa 


cctctcggag 


aacctccagc 


aacccccgct 


gctgtgggac 


2820 


ctactacaat 


ggcttcaggc 


ggtggcgcac 


caatggcaga 


caataacgaa 


ggcgccgacg 


2880 


gagtgggtaa 


tgcctcagga 


aattggcatt 


gcgattccac 


atggctgggc 


gacagagtca 


2940 


tcaccaccag 


cacccgcacc 


tgggccttgc 


ccacctacaa 


taaccacctc 


tacaagcaaa 


3000 
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tctccagtgc 


ttcaacgggg 


gccagcaacg 


acaaccacta 


cttcggctac 


agcaccccct 


3060 


gggggtattt 


tgatttcaac 


agattccact 


gccacttttc 


accacgtgac 


tggcagegae 


3120 


tcatcaacaa 


caattgggga 


ttccggccca 


agagactcaa 


cttcaaactc 


ttcaacatcc 


3180 


aagtcaagga 


ggtcacgacg 


aatgatggcg 


tcacaaccat 


cgctaataac 


cttaccagca 


3240 


cggttcaagt 


cttctcggac 


teggagtace 


agettcegta 


cgtcctcggc 


tctgcgcacc 


3300 


agggctgcct 


ccctccgttc 


ccggcggacg 


tgttcatgat 


tccgcaatac 


ggctacctga 


3360 


cgctcaacaa 


tggcagccaa 


gccgtgggac 


gttcatcctt 


ttactgcctg 


gaatatttcc 


3420 


cttctcagat 


gctgagaacg 


ggcaacaact 


ttaccttcag 


ctacaccttt 


gaggaagtgc 


3480 


ctttccacag 


cagctacgcg 


cacagccaga 


gcctggaccg 


gctgatgaat 


cctctcatcg 


3540 


accaatacct 


gtattacctg 


aacagaactc 


aaaatcagtc 


eggaagtgee 


caaaacaagg 


3600 


acttgctgtt 


tagccgtggg tctccagctg gcatgtctgt tcagcccaaa aactggctac 


3660 


ctggaccctg 


ttatcggcag 


cagcgcgttt 


ctaaaacaaa 


aacagacaac 


aacaacagca 


3720 


attttacctg 


gactggtgct 


tcaaaatata 


acctcaatgg 


gcgtgaatcc 


atcatcaacc 


3780 


ctggcactgc 


tatggectea 


cacaaagacg 


acgaagacaa 


gttctttccc 


atgagcggtg 


.3840 


tcatgatttt 


tggaaaagag 


agegceggag 


cttcaaacac 


tgcattggac 


aatgtcatga 


3900 


ttacagacga 


agaggaaatt 


aaagecacta 


accctgtggc 


caccgaaaga 


tttgggaccg 


3960 


^tggcagtcaa 


tttccagagc 


agcagcacag 


accctgcgac 


cggagatgtg catgetatgg 


4020 


gagcattacc 


tggcatggtg 


tggcaagata gagacgtgta 


ectgeagggt 


cccatttggg 


4080 


ccaaaattcc 


tcacacagat 


ggacactttc 


acccgtctcc 


tcttatgggc 


ggctttggac 


4140 


tcaagaaccc 


gcctcctcag 


atcctcatca 


aaaacacgcc 


tgttcctgcg 


aatcctccgg 


4200 


eggagtttte 


agctacaaag 


tttgettcat 


tcatcaccca 


atactccaca 


ggacaagtga 


4260 


gtgtggaaat 


tgaatgggag 


ctgcagaaag 


aaaacagcaa 


gcgctggaat 


cccgaagtgc 


4320 


agtacacatc 


caattatgea 


aaatctgeca 


acgttgattt 


tactgtggac 


aacaatggac 


4380 


tttatactga 


gcctcgcccc 


attggcaccc 


gttaccttac 


ccgtcccctg 


taattacgtg 


4440 


ttaatcaata 


aaccggttga ttcgtttcag ttgaactttg gtctcctgtc 


cttcttatct 


4500 


tateggttae 


catggttata 


gcttacacat 


taactgcttg 


gttgegctte 


gcgataaaag 


4560 


acttaegtea 


tegggttace 


cctagtgatg gagttgecca 


ctccctctct 


gcgcgctcgc 


4620 


tegcteggtg 


gggectgegg 


accaaaggtc 


cgcagacggc 


agagctctgc 


tctgccggcc 


4680 


ccaccgagcg 


agegagegeg 


cagagaggga 


gtgggcaa 






4718 



<210> 7 

<211> 4675 

<212> DNA 

<213> adeno-associated virus serotype 2 
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<400> 7 

ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60 

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120 

gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 180 

ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 240 

gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 300 

ggtttgaacg cgcagccgcc atgccggggt tttacgagat tgtgattaag gtccccagcg 360 

accttgacgg gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg 420 

aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 480 

ccgtggccga gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc 540 

cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 600 

tcgtggaaac caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg 660 

aaaaactgat tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg 720 

tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 780 

ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 840 

agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 900 

cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 960 

cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1020 

aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1080 

atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1140 

tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1200 

ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 12 60 

ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg 1320 

ggcctgcaac taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct 1380 

acgggtgcgt aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg 1440 

tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1500 

tcggaggaag caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga 1560 

ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1620 

ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1680 

tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 1740 

aggatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1800 

gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 18 60 

agccatcgac gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat 1920 
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gttctcgtca 


cgtgggcatg 


aatctgatgc 


tgtttccctg 


cagacaatgc 


gagagaatga 


1980 


atcagaattc 


aaatatctgc 


ttcactcacg 


gacagaaaga 


ctgtttagag 


tgctttcccg 


2040 


tgtcagaatc 


tcaacccgtt 


tctgtcgtca 


aaaaggcgta 


tcagaaactg 


tgctacattc 


2100 


atcatatcat 


gggaaaggtg ccagacgctt gcactgcctg cgatctggtc 


aatgtggatt 


2160 


tggatgactg 


catctttgaa 


caataaatga 


tttaaatcag 


gtatggctgc 


cgatggttat 


2220 


cttccagatt 


ggctcgagga 


cactctctct 


gaaggaataa 


gacagtggtg 


gaagctcaaa 


2280 


cctggcccac 


caccaccaaa 


gcccgcagag cggcataagg acgacagcag 


gggtcttgtg 


2340 


cttcctgggt 


acaagtacct 


cggacccttc 


aacggactcg 


acaagggaga 


gccggtcaac 


2400 


gaggcagacg 


ccgcggccct 


cgagcacgta 


caaagcctac 


gaccggcagc 


tcgacagcgg 


2460 


agacaacccg 


tacctcaagt 


acaaccacgc 


cgacgcggag 


tttcaggagc 


gccttaaaga 


2520 


agatacgtct 


tttgggggca 


acctcggacg 


agcagtcttc 


caggcgaaaa 


agagggttct 


2580 


tgaacctctg 


ggcctggttg 


aggaacctgt 


taagacggct 


ccgggaaaaa 


agaggccggt 


2640 


agagcactct 


cctgtggagc 


cagactcctc 


ctcgggaacc 


ggaaaggcgg 


gccagcagcc 


2700 


tgcaagaaaa 


agattgaatt 


ttggtcagac 


tggagacgca 


gactcagtac 


ctgaccccca 


2760 


gcctctcgga 


cagccaccag cagccccctc tggtctggga actaatacga 


tggctacagg 


2820 


cagtggcgca 


ccaatggcag acaataacga gggcgccgac ggagtgggta 


attcctccgg 


2 88 0 


aaattggcat 


tgcgattcca 


catggatggg 


cgacagagtc 


atcaccacca 


gcacccgaac 


2940 


ctgggccctg 


cccacctaca 


acaaccacct 


ctacaaacaa 


atttccagcc 


aatcaggagc 


3000 


ctcgaacgac 


aatcactact 


ttggctacag 


caccccttgg 


gggtattttg 


acttcaacag 


3060 


attccactgc 


cacttttcac 


cacgtgactg 


gcaaagactc 


atcaacaaca 


actggggatt 


3120 


ccgacccaag 


agactcaact 


tcaagctctt 


taacattcaa 


gtcaaagagg 


tcacgcagaa 


3180 


tgacggtacg 


acgacgattg 


ccaataacct 


taccagcacg 


gttcaggtgt 


ttactgactc 


3240 


ggagtaccag 


ctcccgtacg 


tcctcggctc 


ggcgcatcaa 


ggatgcctcc 


cgccgttccc 


3300 


agcagacgtc 


ttcatggtgc 


cacagtatgg 


atacctcacc 


ctgaacaacg 


ggagtcaggc 


3360 


agtaggacgc 


tcttcatttt 


actgcctgga 


gtactttcct 


tctcagatgc 


tgcgtaccgg 


3420 


aaacaacttt 


accttcagct 


acacttttga 


ggacgttcct 


ttccacagca 


gctacgctca 


3480 


cagccagagt 


ctggaccgtc 


tcatgaatcc 


tctcatcgac 


cagtacctgt 


attacttgag 


3540 


cagaacaaac 


actccaagtg 


gaaccaccac 


gcagtcaagg 


cttcagtttt 


ctcaggccgg 


3600 


agcgagtgac 


attcgggacc 


agtctaggaa ctggcttcct ggaccctgtt 


accgccagca 


3660 


gcgagtatca 


aagacatctg 


cggataacaa 


caacagtgaa 


tactcgtgga 


ctggagctac 


3720 


caagtaccac 


ctcaatggca 


gagactctct 


ggtgaatccg 


gccatggcaa 


gccacaagga 


3780 


cgatgaagaa 


aagttttttc 


ctcagagcgg 


ggttctcatc 


tttgggaagc 


aaggctcaga 


3840 
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gaaaacaaat 

caatcccgtg 

acaagcagct 

cagagatgtg 

tcacccctct 

caagaacacc 

cttcatcaca 

aggaaaacag 

ttaatcgtgg 

ccagatacct 

cagttgaact 

agtagcatgg 

cctctctgcg 

gctttgcccg 



gtgaacattg 

gctacggagc 

accgcagatg 

taccttcagg 

cccctcatgg 

ccggtacctg 

cagtactcca 

caaacgctgg 

acttaccgtg 

gactcgtaat 

ttggtctctg 

cgggttaatc 

cgctcgctcg 

ggcggcctca 



aaaaggtcat 

agtatggttc 

tcaacacaca 

ggcccatctg 

gtggattcgg 

cgaatccttc 

cgggacacgg 

aatcccgaaa 

gatactaatg 

ctgtaattgc 

cgtatttctt 

attaactaca 

ctcactgagg 

gtgagcgagc 



gattacagac 

tgtatctacc 

aggcgttctt 

ggcaaagatt 

acttaaacac 

gaccaccttc 

tcagcgtgga 

ttcagtacac 

gcgtgtattc 

ttgttaatca 

tcttatctag 

aggaacccct 

ccgggcgacc 

gagcgcgcag 



gaagaggaaa 

aacctccaga 

ccaggcatgg 

ccacacacgg 

cctcctccac 

agtgcggcaa 

gatcgagtgg 

ttccaactac 

agagcctcgc 

ataaaccgtt 

tttccatggc 

agtgatggag 

aaaggtcgcc 

agagggagtg 



tcggaacaac 

gaggcaacag 

tctggcagga 

acggacattt 

agattctcat 

agtttgcttc 

gagctgcaga 

aacaagtctg 

cccattggca 

taattcgttt 

tacgtagata 

ttggccactc 

cgacgcccgg 

gccaa 



<210> 8 
<211> 4726 
<212> DNA 

<213> adeno-associated virus serotype 3 
<400> 8 

ttggccactc cctctatgcg cactcgctcg ctcggtgggg cctggcgacc aaaggtcgcc 
agacggacgt gctttgcacg tccggcccca ccgagcgagc gagtgcgcat agagggagtg 
gccaactcca tcactagagg tatggcagtg acgtaacgcg aagcgcgcga agcgagacca 
cgcctaccag ctgcgtcagc agtcaggtga cccttttgcg acagtttgcg acaccacgtg 
gccgctgagg gtatatattc tcgagtgagc gaaccaggag ctccattttg accgcgaaat 
ttgaacgagc agcagccatg ccggggttct acgagattgt cctgaaggtc ccgagtgacc 
tggacgagcg cctgccgggc atttctaact cgtttgttaa ctgggtggcc gagaaggaat 
gggacgtgcc gccggattct gacatggatc cgaatctgat tgagcaggca cccctgaccg 
tggccgaaaa gcttcagcgc gagttcctgg tggagtggcg ccgcgtgagt aaggccccgg 
aggccctctt ttttgtccag ttcgaaaagg gggagaccta cttccacctg cacgtgctga 
ttgagaccat cggggtcaaa tccatggtgg tcggccgcta cgtgagccag attaaagaga 
agctggtgac ccgcatctac cgcggggtcg agccgcagct tccgaactgg ttcgcggtga 
ccaaaacgcg aaatggcgcc gggggcggga acaaggtggt ggacgactgc tacatcccca 
actacctgct ccccaagacc cagcccgagc tccagtgggc gtggactaac atggaccagt 



3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4675 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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atttaagcgc ctgtttgaat ctcgcggagc gtaaacggct ggtggcgcag catctgacgc 900 

acgtgtcgca gacgcaggag cagaacaaag agaatcagaa ccccaattct gacgcgccgg 960 

tcatcaggtc aaaaacctca gccaggtaca tggagctggt cgggtggctg gtggaccgcg 1020 

ggatcacgtc agaaaagcaa tggattcagg aggaccaggc ctcgtacatc tccttcaacg 1080 

ccgcctccaa ctcgcggtcc cagatcaagg ccgcgctgga caatgcctcc aagatcatga 1140 

gcctgacaaa gacggctccg gactacctgg tgggcagcaa cccgccggag gacattacca 1200 

aaaatcggat ctaccaaatc ctggagctga acgggtacga tccgcagtac gcggcctccg 1260 

tcttcctggg ctgggcgcaa aagaagttcg ggaagaggaa caccatctgg ctctttgggc 1320 

cggccacgac gggtaaaacc aacatcgcgg aagccatcgc ccacgccgtg cccttctacg 1380 

gctgcgtaaa ctggaccaat gagaactttc ccttcaacga ttgcgtcgac aagatggtga 1440 

tctggtggga ggagggcaag atgacggcca aggtcgtgga gagcgccaag gccattctgg 1500 

gcggaagcaa ggtgcgcgtg gaccaaaagt gcaagtcatc ggcccagatc gaacccactc 1560 

ccgtgatcgt cacctccaac accaacatgt gcgccgtgat tgacgggaac agcaccacct 1620 

tcgagcatca gcagccgctg caggaccgga tgtttgaatt tgaacttacc cgccgtttgg 1680 

accatgactt tgggaaggtc accaaacagg aagtaaagga ctttttccgg tgggcttccg 1740 

atcacgtgac tgacgtggct catgagttct acgtcagaaa gggtggagct aagaaacgcc 1800 

ccgcctccaa tgacgcggat gtaagcgagc caaaacggga gtgcacgtca cttgcgcagc 1860 

cgacaacgtc agacgcggaa gcaccggcgg actacgcgga caggtaccaa aacaaatgtt 1920 

ctcgtcacgt gggcatgaat ctgatgcttt ttccctgtaa aacatgcgag agaatgaatc 1980 

aaatttccaa tgtctgtttt acgcatggtc aaagagactg tggggaatgc ttccctggaa 2040 

tgtcagaatc tcaacccgtt tctgtcgtca aaaagaagac ttatcagaaa ctgtgtccaa 2100 

ttcatcatat cctgggaagg gcacccgaga ttgcctgttc ggcctgcgat ttggccaatg 2160 

tggacttgga tgactgtgtt tctgagcaat aaatgactta aaccaggtat ggctgctgac 2220 

ggttatcttc cagattggct cgaggacaac ctttctgaag gcattcgtga gtggtgggct 2280 

ctgaaacctg gagtccctca acccaaagcg aaccaacaac accaggacaa ccgtcggggt 2 340 

cttgtgcttc cgggttacaa atacctcgga cccggtaacg gactcgacaa aggagagccg 2400 

gtcaacgagg cggacgcggc agccctcgaa cacgacaaag cttacgacca gcagctcaag 2460 

gccggtgaca acccgtacct caagtacaac cacgccgacg ccgagtttca ggagcgtctt 2520 

caagaagata cgtcttttgg gggcaacctt ggcagagcag tcttccaggc caaaaagagg 2580 

atccttgagc ctcttggtct ggttgaggaa gcagctaaaa cggctcctgg aaagaagggg 2640 

gctgtagatc agtctcctca ggaaccggac tcatcatctg gtgttggcaa atcgggcaaa 2700 

cagcctgcca gaaaaagact aaatttcggt cagactggag actcagagtc agtcccagac 2760 
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cctcaacctc 


tcggagaacc 


accagcagcc 


cccacaagtt 


tgggatctaa 


tacaatggct 


2820 


tcaggcggtg gcgcaccaat ggcagacaat 


aacgagggtg 


ccgatggagt 


gggtaattcc 


2880 


4* rt Kt ^ A A ^ +" 

LCayyaaat L 


ggcattgcga ttcccaatgg 


ctgggcgaca 


gagtcatcac 


caccagcacc 


2940 


ayadLLLy y y 


utu uyou^aL 




catctctaca 


agcaaatctc 


cagccaatca 


3000 


y yaLju u uu a. ct 


o.LyuUuuL>uu 


w uau u u l y y ^ 


tacagcaccc 


cttgggggta 


ttttgacttt 


3060 


aaLay aL uuu 


ui> i- y ul< u o 


rtcaccaccrt 


gactggcagc 


gactcattaa 


caacaactgg 


3120 


yyaiLOi>yy ^ 


("raaoaaart 

i» u» cm y a. aa v. 


racrcttcaaa 


ctcttcaaca 


tccaagttag 


aggggtcacg 


3180 




gcacgacgac 


tattgecaat 


aaccttacca 


geaeggttea 


agtgtttacg 


3240 


y au uuy y y 


atcadctccc 


crtaccrtcrctc 


gggteggege 


accaaggctg 


tctcccgccg 


3300 


■hUrpafTrTfrf 
LLLUtay*-yy 


arrftrttcat 


rfrrtccctcacf 

U U U W W W <w W4. U 


tatggatacc 


tcaccctgaa 


caaeggaagt 


3360 


Luayuy y l. y y 


naccrctcatc 

VJ C_4 *-4 w w CA l« w 


cfctttactcrc 


ctggagtact 


tcccttcgca 


gatgetaagg 


3420 


aL. L. y y add u d 


acttccaatt 


cagct at acc 


ttcgaggatg 


taccttttca 


cagcagctac 


3480 


octcacacrcc 

U X-* 1 \* VA (A y W W 


agagtttgga 


tegcttgatg 


aatcctctta 


ttgatcagta 


tctgtactac 


3540 




cgcaaggaac 


aac ct c tgga 


acaaccaacc 


aatcaegget 


gctttttagc 


3600 


L->ciyyi — y y y 


rtcacrtctat 


gtctttgcag 


gecagaaatt 


ggctacctgg 


gccctgctac 


3 660 


l. y y UQaL. ^ y ^* 


oactttcaaa 

Vj 1-4. * — ' \— ^ I — ^ CA (A VA 


cractcrctaac 

M >^ *•* W LA U w 


gacaacaaca 


acagtaactt 


tccttggaca 


3720 


ct r rt ctr* r 7\ ft r* ?i 


aafatratct 


caatcrcrcccrc 

s> <asa i» y y ^ y \-» 


gaetegctgg 


tgaatccagg 


accagctatg 


3780 


rrr*r , A<Hri~r , »r , *A 
y tLay luaua 


ayya.v-yoLyci 


afiaaaaat"t*t 

ca y CA CACA CACA c» l* w 


ttccctatgc 


aeggcaatet 


aatatttggc 


3840 


dday aayyy a 


raarfrnraarf 
uudLy yuaciy 


f- ^ n r* rr r* a rr a ^ 


ttagataatg 


taatgattac 


ggatgaagaa 


3900 


g ayaLuCgta 




tgtggcaaca 


gagcagtatg 


gaactgtggc 


aaataacttg 


3960 


carrart^ , t"r , a a 
L> dy ay^LUdQ 


atacagctcc 


cacgactgga 


actgtcaatc 


atcagggggc 


cttacctggc 


4020 


atggtgtggc 


aagatcgtga 


cgtgtacctt 


caaggaccta 


tctgggcaaa 


gattcctcac 


4080 


acggatggac 


actttcatcc 


ttctcctctg 


atgggaggct 


ttggactgaa 


acatccgcct 


4140 


cctcaaatca 


tgatcaaaaa 


tactceggta 


ccggcaaatc 


ctccgacgac 


tttcagcccg 


4200 


gccaagtttg 


cttcatttat 


cactcagtac 


tccactggac 


aggtcagegt 


ggaaattgag 


4260 


tgggagctac 


agaaagaaaa 


cagcaaacgt tggaatccag agattcagta cacttccaac 


4 320 


tacaacaagt ctgttaatgt ggactttact gtagacacta atggtgttta tagtgaacct 


4380 


cgccctattg 


gaacccggta 


tctcacacga 


aacttgtgaa 


tcctggttaa 


tcaataaacc 


4440 


gtttaattcg 


tttcagttga 


actttggctc 


ttgtgcactt 


ctttatcttt 


atcttgtttc 


4500 


catggctact 


gcgtagataa 


gcagcggcct 


gcggcgcttg 


cgcttcgcgg 


tttacaactg 


4560 


ctggttaata 


tttaactctc 


gccatacctc 


tagtgatgga 


gttggccact 


ccctctatgc 


4620 


gcactcgctc 


gctcggtggg 


gcctggcgac 


caaaggtege 


cagaeggacg 


tgetttgeae 


4680 
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gtccggcccc accgagcgag cgagtgcgca tagagggagt ggccaa 4726 

<210> 9 
<211> 3098 
<212> DNA 

<213> new AAV serotype, clone 42.2 
<400> 9 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcttccg 180 

cccagatcga tcccaccccc gtgatcgtca cttccaacac caacatgtgc gctgtgattg 240 

acgggaacag caccaccttc gagcaccagc agccgttaca agaccggatg ttcaaatttg 300 

aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcaaagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 660 

gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 
ggaaactctg tgccattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 
gcgatctggt caacgtggac ctggatgacc gtgtttctga gcaataaatg acttaaacca 
ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaagggag agccggtcaa cgaggcagac gccgcggccc tcgagcacga caaggcctac 1080 

gacaagcagc tcgagcaggg ggacaacccg tacctcaagt acaaccacgc cgacgccgag 1140 

tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagacccat agaatccccc gactcctcca cgggcatcgg caagaaaggc 1320 

cagcagcccg ctaaaaagaa gctcaacttt gggcagactg gcgactcaga gtcagtgccc 1380 

gacccccaac ctctcggaga acctcccgcc gcgccctcag gtctgggatc tggtacaatg 1440 

gctgcaggcg gtggcgcacc aatggcagac aataacgaag gcgccgacgg agtgggtaat 1500 

gcctccggaa attggcattg cgattccaca tggctgggcg acagagtcat caccaccagc 1560 

acccgcacct gggccctgcc cacctacaac aaccacctct acaagcagat atcaagtcag 1620 

agcggggcta ccaacgacaa ccacttcttc ggctacagca ccccctgggg ctattttgac 1680 



780 
840 
900 
960 
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ttcaacagat 


tccactgcca 


cttctcacca 


cgtgactggc 


agcgactcat 


caacaacaac 


1740 


tggggattcc 


ggcccagaaa 


gctgcggttc 


aagttgttca 


acatccaggt 


caaggaggtc 


1800 


acgacgaacg 


acggcgttac 


gaccatcgct 


aataacctta 


ccagcacgat 


tcaggtcttc 


1860 


tcggactcgg 


agtaccaact 


gccgtacgtc 


ctcggctctg 


cgcaccaggg 


ctgcctccct 


1920 


ccgttccctg 


cggacgtgtt 


catgattcct 


cagtacggat 


atctgactct 


aaacaacggc 


1980 


agtcagtctg 


tgggacgttc 


ctccttctac 


tgcctggagt 


actttccttc 


tcagatgctg 


2040 


agaacgggca 


ataactttga 


attcagctac 


acctttgagg 


aagtgccttt 


ccacagcagc 


2100 


tatgcgcaca 


gccagagcct 


ggaccggctg 


atgaatcccc 


tcatcgacca 


gtacctgtac 


2160 


tacctggccc 


ggacccagag 


cactacgggg 


tccacaaggg 


agctgcagtt 


ccatcaggct 


2220 


gggcccaaca 


ccatggccga 


gcaatcaaag 


aactggctgc 


ccggaccctg 


ttatcggcag 


228 0 


cagagactgt 


caaaaaacat 


agacagcaac 


aacaacagta 


actttgcctg 


gaccggggcc 


2340 


actaaatacc 


atctgaatgg 


tagaaattca 


ttaaccaacc 


cgggcgtagc 


catggccacc 


2400 


aacaaggacg 


acgaggacca 


gttctttccc 


atcaacggag tgctggtttt tggcgaaacg 


2460 


ggggctgcca 


acaagacaac 


gctggaaaac 


gtgctaatga 


ccagcgagga 


ggagatcaaa 


2520 


accaccaatc 


ccgtggctac 


agaagaatac 


ggtgtggtct 


ccagcaacct 


gcaatcgtct 


2580 


acggccggac 


cccagacaca 


gactgtcaac 


agccaggggg 


ctctgcccgg 


catggtctgg 


2640 


cagaaccggg 


acgtgtacct 


gcagggtccc 


atctgggcca 


aaattcctca 


cacggacggc 


2700 


aactttcacc 


cgtctcccct 


gatgggcgga 


tttggactca 


aacacccgcc 


tcctcaaatt 


2760 


ctcatcaaaa 


acaccccggt 


acctgctaat 


cctccagagg 


tgtttactcc 


tgccaagttt 


2820 


gcctcattta 


tcacgcagta 


cagcaccggc 


caggtcagcg 


tggagatcga 


gtgggaactg 


2880 


cagaaagaaa 


acagcaaacg 


ctggaatcca 


gagattcagt 


acacctcaaa 


ttatgccaag 


2940 


tctaataatg 


tggaatttgc 


tgtcaacaac 


gaaggggttt 


atactgagcc 


tcgccccatt 


3000 


ggcacccgtt 


acctcacccg 


taacctgtaa 


ttgcctgtta 


atcaataaac 


cggttaattc 


3060 


gtttcagttg 


aactttggtc 


tctgcgaagg 


gcgaattc 






3098 



<210> 10 
<211> 3098 
<212> DNA 

<213> new AAV serotype, clone 16.3 
<400> 10 

gaattcgccc ttcgcagaga ccaaagttca actgaaacga atcaaccggt ttattgatta 60 
acaagtaatt acaggttacg ggtgaggtaa cgggtgccaa tggggcgagg ctcagtataa 120 
accccttcgt tgttgacagc aaattccaca ttattagact tggcataatt tgaggtgtac 180 
tgaatctctg gattccagcg tttgctgttt tctttctgca gttcccactc gatctccacg 240 
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ctgacctggc cggtgctgta ctgcgtgata aatgaggcaa actaggcagg agtaaacacc 300 

cctggaggat tagcaggtac cggggtgttt ttgatgagaa tttgaggagg cgggtgtttg 360 

agtccaaatc cgcccatcag gggagacggg tgaaagttgc cgtccgtgtg aggaattttg 420 

gcccagatgg gaccctgcag gtacacgtcc cggttctgcc agaccatgcc gggcagagcc 480 

ccctggctgt tgacagtctg tgtctggggt ccggccgtag acgattgcag gttgctggag 540 

accacaccgt attcttctgt agccacggga ttggtggttt tgatctcctc ctcgctggtc 600 

attagcacgt tttccagcgt tgtcttgttg gcagcccccg ttttgccaaa aaccagcact 660 

ccgttgatgg gaaagaactg gccctcgtcg tccttgttgg tggccatggc tacgcccggg 720 

ttggttaatg aatttctacc attcagatgg tatttagtgg ccccggtcca ggcaaagtta 780 

ctgttgttgt tgctgtctat gttttttgac agtctctgct gccgataaca gggtccgggc 840 
agccagttct ttgattgctc ggccatggtg ttgggcccag cctgatggaa ctgcagctcc 900 
cttgtggacc ccgtagtgct ctgggtccgg gccaggtagt acaggtactg gtcgatgagg 960 

ggattcatca gccggtccag gctctggctg tgcgcatagc tgctgtggaa aggcacttcc 1020 

tcaaaggtgt agctgaattc aaagttattg cccgttctca gcatctgaga aggaaagtac 1080 

tccaggcagt agaaggagga acgtcccata gactgactgc cgttgtttag agtcagatat 1140 

ccgtactgag gaatcatgaa cacgtccgca gggaacggag ggaggcagcc ctggtgcgca 1200 

gagccgagga cgtacggcag ttggtactcc gagtccgaga agacctgaat cgtgctggta 1260 

aggttattag cgatggtcgt aacgccgtcg ttcgtcgtga cctccttgac ctggatgttg 1320 

aacaacttga accgcagctt tctgggccgg aatccccagt tgttgttgat gagtcgctgc 1380 

cagtcacgtg gtgagaagtg gcagtggaat ctgttgaagt caaaatagcc ccagggggtg 1440 

ctgtagccga agaagtggtt gtcgttggta gccccgctct gacttgatat ctgcttgtag 1500 

aggtggttgt tgtaggtggg cagggcccag gtgcgggtgc tggtggtgat gactctgtcg 1560 

cccagccatg tggaatcgca atgccaattt ccggaggcat tacccactcc gtcggcgcct 1620 

tcgttattgt ctgccattgg tgcgccaccg cctgcagcca ttgtaccaga tcccagacct 1680 

gagggcgcgg cgggaggttc tccgagaggt tgggggtcgg gcactgactc tgagtcgcca 1740 

gtctgcccaa agttgagctt ctttttagcg ggctgctggc ctttcttgcc gatgcccgtg 1800 

gaggagtcgg gggattctat gggtctcttc tttccaggag ccgtcttagc gccttcctca 18 60 

accagaccga gaggttcgag aacccgcttc ttggcctgga agactgctcg cccgaggttg 1920 

cccccaaaag acgtatcttc ttgaagacgc tcctgaaact cagcgtcggc gtggttgtac 1980 

ttgaggtacg ggttgtcccc ctgctcgagc tgcttgtcgt aggccttgtc gtgctcgagg 2040 

gccgcggcgt ctgcctcgtt gaccggctct cccttgtcga gtccgttgaa gggtccgagg 2100 
tacttgtagc caggaagcac cagaccccgg ccgtcgtcct gcttttgctg gttggctttg 2160 
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ggtttcgggg ctccaggttt caagtcccac cactcgcgaa tgccctcaga gaggttgtcc 2220 
tcgagccaat ctggaagata accatcggca gccatacctg gtttaagtca tttattgctc 



2280 



agaaacacag tcatccaggt ccacgttgac cagatcgcag gccgagcaag caatctcggg 2340 



2400 



3000 
3060 



agcccgcccc agcagatgat gaatggcaca gagtttccga tacgtcctct ttctgacgac 

cggttgagat tctgacacgc cggggaaaca ttctgaacag tctctggtcc cgtgcgtgaa 2460 

gcaaatgttg aaattctgat tcattctctc gcatgtcttg cagggaaaca gcatctgaag 2520 

catgcccgcg tgacgagaac atttgttttg gtacctgtcg gcaaagtcca ccggagctcc 2580 

ttccgcgtct gacgtcgatg gatccgcgac tgaggggcag gcccgcttgg gctcgctttt 2640 

atccgcgtca tcgggggcgg gcctcttgtt ggctccaccc tttctgacgt agaactcatg 2700 

cgccacctcg gtcacgtgat cctgcgccca gcggaagaac tctttgactt cctgctttgt 2760 

caccttgcca aagtcctgct ccagacggcg ggtgagttca aatttgaaca tccggtcttg 2820 

taacggctgc tggtgctcga aggtggtgct gttcccgtca atcacggcgc acatgttggt 2880 

gttggaagtg acgatcacgg gggtgggatc gatctgggcg gacgacttgc acttttggtc 2940 
cacgcgcacc ttgctgccgc cgagaatggc cttggcggac tccacgacct tggccgtcat 
cttgccctcc tcccaccaga tcaccatctt gtcgacgcaa tcgttgaagg gaaagttctc 

attggtccag ttgacgcagc cgtagaaagg gcgaattc 3098 

<210>. 11 
<211> 3121 
<212> DNA 

<213> new AAV serotype, clone 29.3 
<400> 11 

gaattcgccc ttcgcagaga ccaaagttca actgaaacga atcaaccggt ttattgatta 60 
acaagcaatt acagattacg ggtgaggtaa cgggtgccga tggggcgagg ctcagaataa 120 
gtgccatctg tgttaacagc aaagtccaca tttgtagatt tgtagtagtt ggaagtgtat 180 
tgaatctctg ggttccagcg tttgctgttt tctttctgca gctcccattc aatttccacg 240 
ctgacctgtc cggtgctgta ctgcgtgatg aacgacgcca gcttagcttg actgaaggta 300 
gttggaggat ccgcgggaac aggtgtattc ttaatcagga tctgaggagg cgggtgtttc 360 
agtccaaagc cccccatcag cggcgaggga tgaaagtttc cgtccgtgtg aggaatcttg 
gcccagatag gaccctgcag gtacacgtcc cggttctgcc agaccatgcc aggtaaggct 
ccttgactgt tgacggcccc tacaatagga gcggcgtttt gctgttgcag gttatcggcc 54 0 
accacgccgt actgttctgt ggccactggg ttggtggttt taatttcttc ctcactggtt 
agcataacgc tgctatagtc cacgttgcct tttccagctc cctgtttccc aaacattaag 
actccgctgg acggaaaaaa tcgctcttcg tcgtccttgt gggttgccat agcgacaccg 720 
ggatttacca gagagtctct gccattcaga tgatacttgg tggcaccggt ccaggcaaag 780 



420 
480 



600 
660 
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ttgctgttgt 


tattttgcga 


cagtgtcgtg gagacgcgtt 


gctgccggta 


gcagggcccg 


840 


ggtagccagt 


ttttggcctg 


agccgacatg ttattaggcc 


cggcctgaga 


aaatagcaac 


900 


tgctgagttc 


ctgcggtacc 


tcccgtggac tgagtccgag 


acaggtagta 


caggtactgg 


960 


tcgatgaggg 


ggttcatcag 


ccggtccagg ctttggctgt 


gcgcgtagct 


gctgtgaaaa 


1020 


ggcacgtcct 


caaactggta 


gctgaactca aagttgttgc 


ccgttctcag 


catttgagaa 


1080 


ggaaagtact 


ccaggcagta 


gaaggaggaa cggcccacgg 


cctgactgcc 


attgttcaga 


1140 


gtcaggtacc 


cgtactgagg 


aatcatgaag acgtccgccg 


ggaacggagg 


caggcagccc 


1200 


tggcgcgcag 


agccgaggac 


gtacgggagc tggtattccg 


agtccgtaaa 


gacctgaatc 


1260 


gtgctggtaa 


ggttattggc 


gatggtcttg gtgccttcat 


tctgcgtgac 


ctccttgacc 


1320 


tggatgttga 


agagcttgaa 


gttgagtctc ttgggccgga 


atccccagtt 


gttgttgatg 


1380 


agtcgctgcc 


agtcacgtgg 


tgagaagtgg cagtggaatc 


tgttaaagtc 


aaaatacccc 


1440 


cagggggtgc 


tgtagccgaa 


gtaggtgttg tcgttggtgc 


ttcctcccga 


agtcccgttg 


1500 


gagatttgct 


tgtagaggtg 


gttgttgtag gtggggaggg 


cccaggttcg 


ggtgctggtg 


1560 


gtgatgactc 


tgtcgcccag 


ccatgtggaa tcgcaatgcc 


aatttcctga 


ggaactaccc 


1620 


actccgtcgg cgccttcgtt 


attgtctgcc attggagcgc 


caccgcctgc 


agccattgta 


1680 


ccagatccca 


gaccagaggg 


gcctgcgggg ggttctccga 


ttggttgagg 


gtcgggcact 


1740 


gactctgagt 


cgccagtctg 


cccaaagttg agtctctttt 


tcgcgggctg 


ctggcctttc 


1800 


ttgccgatgc 


ccgtagtgga 


gtctggagaa cgctggggtg 


atggctctac 


cggtctcttc 


1860 


tttccaggag 


ccgtcttagc 


gccttcctca accagaccga 


gaggttcgag 


aacccgcttc 


1920 


ttggcctgga 


agactgctcg 


tccgaggttg cccccaaaag 


acgtatcttc 


ttgcagacgc 


1980 


tcctgaaact 


cggcgtcggc 


gtggttatac cgcaggtacg 


gattgtcacc 


cgctttgagc 


2040 


tgctggtcgt 


aggccttgtc 


gtgctcgagg gccgctgcgt 


ccgccgcgtt 


gacgggctcc 


2100 


cccttgtcga 


gtccgttgaa 


gggtccgagg tacttgtagc 


caggaagcac 


cagaccccgg 


2160 


ccgtcgtcct 


gcttttgctg 


gttggctttg ggcttcgggg 


ctccaggttt 


cagcgcccac 


2220 


cactcgcgaa 


tgccctcaga 


gaggttgtcc tcgagccaat 


ctggaagata 


accatcggca 


2280 


gccatacctg 


atctaaatca 


tttattgttc aaagatgcag 


tcatccaaat 


ccacattgac 


2340 


cagatcgcag gcagtgcaag cgtctggcac ctttcccatg 


atatgatgaa 


tgtagcacag 


2400 


tttctgatac 


gcctttttga 


cgacagaaac gggttgagat 


tctgacacgg 


gaaagcactc 


2460 


taaacagtct 


ttctgtccgt 


gagtgaagca gatatttgaa 


ttctgattca 


ttctctcgca 


2520 


ttgtctgcag 


ggaaacagca 


tcagattcat gcccacgtga 


cgagaacatt 


tgttttggta 


2580 


cctgtccgcg 


tagttgatcg 


aagcttccgc gtctgacgtc 


gatggctgcg 


caactgactc 


2640 


gcgcacccgt 


ttgggctcac 


• ttatatctgc gtcactgggg 


gcgggtcttt 


tcttggctcc 


2700 
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accctttttg acgtagaatt catgctccac ctcaaccacg tgatcctttg cccaccggaa 2760 

aaagtctttg acttcctgct tggtgacctt cccaaagtca tgatccagac ggcgggtgag 2820 

ttcaaatttg aacatccggt cttgcaacgg ctgctggtgt tcgaaggtcg ttgagttccc 2880 

gtcaatcacg gcgcacatgt tggtgttgga ggtgacgatc acgggagtcg ggtctatctg 2940 

ggccgaggac ttgcatttct ggtccacgcg caccttgctt cctccgagaa tggctttggc 3000 

cgactccacg accttggcgg tcatcttccc ctcctcccac cagatcacca tcttgtcgac 3060 

acagtcgttg aagggaaagt tctcattggt ccagttgacg cagccgtaga agggcgaatt 3120 
c 



3121 



<210> 12 
<211> 3121 
<212> DNA 

<213> new AAV serotype, clone 29.4 
<400> 12 

gaattcgccc ttctacggct gcgtcaactg gaccaatgag aactttccct tcaacgactg 60 
tgtcgacaag atggtgatct ggtgggagga ggggaagatg accgccaagg tcgtggagtc 120 
ggccaaagcc attctcggag gaagcaaggt gcgcgtggac cagaaatgca agtcctcggc 180 
ccagatagac ccgactcccg tgatcgtcac ctccaacacc aacatgtgcg ccgtgattga 240 
cgggaactca acgaccttcg aacaccagca gccgttgcaa gaccggatgt tcaaatttga 300 
actcacccgc cgtctggatc atgactttgg gaaggtcacc aagcaggaag tcaaagactt 360 
tttccggtgg gcaaaggatc acgtggttga ggtggagcac gaattctacg tcaaaaaggg 420 
tggagccaag aaaagacccg cccccagtga cgcagatata agtgagccca aacgggtgcg 480 
cgagtcagtt gcgcagccat cgacgtcaga cgcggaagct tcgatcaact acgcagacag 540 
gtaccaaaac aaatgttctc gtcacgcggg catgaatctg atgctgtttc cctgcagaca 600 
atgcgagaga atgaatcaga attcaaatat ctgcttcact cacggacaga aagactgttt 
agagtgcttt cccgtgtcag aatctcaacc cgtttctgtc gtcaaaaagg cgtatcagaa 
actgtgctac attcatcata tcatgggaaa ggtgccagac gcttgcactg cctgcgatct 
ggtcgatgtg gatttggatg actgcatctt tgaacaataa atgatttaaa tcaggtatgg 
ctgccgatgg ttatcttcca gattggctcg aggacaacct ctctgagggc attcgcgagt 900 
ggtgggcgct gaaacctgga gccccgaagc ccaaagccaa ccagcaaaag caggacggcg 960 
gccggggtct ggtgcttcct ggctacaagt acctcggacc cttcaacgga ctcgacaagg 1020 
gggagcccgt caacgcggcg gacgcagcgg ccctcgagca cgacaaggcc tacgaccagc 1080 
agctcaaagc gggtgacaat ccgtacctgc ggtataacca cgccgacgcc gagtttcagg 114 0 
agcgtctgca agaagatacg tcttttgggg gcaacctcgg gcgagcagtc ttccaggcca 1200 



660 
720 
780 
840 
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agaagcgggt 


tctcgaacct 


ctcggtctgg 


ttgaggaagg 


cgctaagacg 


gctcctggaa 


1260 


agaagagacc 


ggtagagcca 


tcaccccagc 


gttctccaga 


ctcctctacg 


ggcatcggca 


1320 


agaaaggcca 


gcagcccgcg 


aaaaagagac 


tcaactttgg 


gcagactggc 


gactcagagt 


1380 


cagtgcccga 


ccctcaacca 


atcggagaac 


cccccgcagg 


cccctctggt 


ctgggatctg 


1440 


gtacaatggc 


tgcaggcggt 


ggcgctccaa 


tggcagacaa 


taacgaaggc 


gccgacggag 


1500 


tgggtagttc 


ctcaggaaat 


tggcattgcg 


attccacatg 


gctgggcgac 


tgagtcatca 


1560 


ccaccagcac 


ccgaacctgg 


gccctcccca 


cctacaacaa 


ccacctctac 


aagcaaatct 


1620 


ccaacgggac 


ttcgggagga 


agcaccaacg 


acaacaccta 


cttcggctac 


agcaccccct 


1680 


gggggtattt 


tgactttaac 


agattccact 


gccacttctc 


accacgtgac 


tggcagcgac 


1740 


tcatcaacaa 


caactgggga 


ttccggccca 


agagactcaa 


cttcaagctc 


ttcaacatcc 


1800 


aggtcaagga 


ggtcacgcag 


aatgaaggca 


ccaagaccat 


cgccaataac 


cttaccagca 


1860 


cgattcaggt 


ctttacggac 


tcggaatacc 


agctcccgta 


cgtcctcggc 


tctgcgcacc 


1920 


agggctgcct 


gcctccgttc 


ccggcggacg 


tcttcatgat 


tcctcagtac 


gggtacctga 


1980 


ctctgaacaa 


tggcagtcag 


gccgtgggcc 


gttcctcctt 


ctactgcctg 


gagtactttc 


2040 


cttctcaaat 


gctgagaacg 


ggcaacaact 


ttgagttcag 


ctaccagttt 


gaggacgtgc 


2100 


cttttcacag 


cagctacgcg 


cacagccaaa 


gcctggaccg 


gctgatgaac 


cccctcatcg 


2160 


accagtacct 


gtactacctg 


tctcggactc 


agtccacggg 


aggtaccgca 


ggaactcagc 


2220 


agttgctatt 


ttctcaggcc 


gggcctaata 


acatgtcggc 


tcaggccaaa 


aactggctac 


2280 


ccgggccctg 


ctaccggcag 


taacgcgtct 


ccacgacact 


gtcgcaaaat 


aacaacagca 


2340 


actttgtctg 


gaccggtgcc 


accaagtatc 


atctgaatgg 


cagagactct 


ctggtagatc 


2400 


ccggtgtcgc 


tatggcaacc 


cacaaggacg 


acgaagagcg 


attttttccg 


tccagcggag 


2460 


tcataatgtt 


tgggaaacag 


ggagctggaa 


aagacaacgt 


ggactatagc 


agcgtcatgc 


2520 


taaccagtga 


ggaagaaatt 


aaaaccacca 


acccagtggc 


cacagaacag 


tacggcgtgg 


2580 


tggccgataa 


cctgcaacag 


caaaacgccg 


ctcctattgt 


aggggccgtc 


aacagtcaag 


2640 


gagccttacc 


tggcatggtc 


tggcagaacc 


gggacgtgta 


cctgcagggt 


cctacctggg 


2700 


ccaagattcc 


tcacacggac 


ggaaactttc 


atccctcgcc 


gctgatggga 


ggctttggac 


2760 


tgaaacaccc 


gcctcctcag 


atcctgatta 


agaatacacc 


tgttcccgcg 


gatcctccaa 


2820 


ctaccttcag 


tcaagctaag 


ctggcgtcgt 


tcatcacgca 


gtacagcacc 


ggacaggtca 


2880 


gcgtggaaat 


tgaatgggag 


ctgcaggaag 


aaaacagcaa 


acgctggaac 


ccagagattc 


2940 


aatacacttc 


caactactac 


aaatctacaa 


atgtggactt 


tgctgttaac 


acagatggca 


3000 


cttattctga 


gcctcgcccc 


atcggcaccc 


gttacctcac 


ccgtaatctg 


taattgcttg 


3060 


ttaatcaata 


aaccggttga 


ttcgtttcag 


ttgaactttg gtctctgcga agggcgaatt 


3120 
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c 3121 

<210> 13 
<211> 3121 
<212> DNA 

<213> new AAV serotype, clone 29.5 
<400> 13 

gaattcgccc ttcgcgagac caaagttcaa ctgaaacgaa tcaaccggtt tattgattaa 60 

caagcaatta cagattacgg gtgaggtaac gggtgccgat ggggcgaggc tcagaataag 120 

tgccatctgt gttaacagca aagtccacat ttgtagattt gtagtagttg gaagtgtatt 180 

gaatctctgg gttccagcgt ttgctgtttt ctttctgcag ctcccattca atttccacgc 240 

tgacctgtcc ggtgctgtac tgcgtgatga acgacgccag cttagcttga ctgaaggtag 300 

ttggaggatc cgcgggaaca ggtgtattct taatcaggat ctgaggaggc gggtgtttca 360 

gtccaaagcc tcccatcagc ggcgagggat gaaagtttcc gtccgtgtga ggaatcttgg 420 

cccagatagg accctgcagg tacacgtccc ggttctgcca gaccatgcca ggtaaggctc 480 

cttgactgtt gacggcccct acaataggag cggcgttttg ctgttgcagg ttatcggcca 540 

ccacgccgta ctgttctgtg gccactgggt tggtggtttt aatttcttcc tcactggtta 600 

gcataacgct gctatagtcc acgttgtctt ttccagctcc ctgtttccca aacattaaga . 660 

ctccgctgga cggaaaaaat cgctcttcgt cgtccttgtg ggttgccata gcgacaccgg 720 

gatttaccag agagtctctg ccattcagat gatacttggt ggcaccggtc caggcaaagt ' 780 

tgctgttgtc attttgcgac agtgtcgtgg agacgcgttg ctgccggtag cagggcccgg 84 0 

gtagccagtt tttggcctga gccgacatgt tattaggccc ggcctgagaa aatagcaact 900 

gctgagttcc tgcggtacct cccgtggact gagtccgaga caggtagtac aggtactggt 960 

cgatgagggg gttcatcagc cggtccaggc tttggctgtg cgcgtagctg ctgtgaaaag 1020 

gcacgtcctc aaactggtag ctgaactcaa agttgttgcc cgttctcagc atttgagaag 108 0 

gaaagtactc caggcagtag aaggaggaac ggcccacggc ctgactgcca ttgttcagag 114 0 

tcaggtaccc gtactgagga atcatgaaga cgtccgccgg gaacggaggc aggcagccct 1200 

ggtgcgcaga gccgaggacg tacgggagct ggtattccga gtccgtaaag acctgaatcg 1260 

tgctggtaag gttattggcg atggtcttgg tgccttcatt ctgcgtgacc tccttgacct 1320 

ggatgttgaa gagcttgaag ttgaggctct tgggccggaa tccccagttg ttgttgatga 1380 

gtcgctgcca gtcacgtggt gagaagtggc agtggaatct gttaaagtca aaataccccc 1440 

agggggtgct gtagccgaag taggtgttgt cgttggtgct tcctcccgaa gtcccgttgg 1500 

agatttgctt gtagaggtgg ttgttgtagg tggggagggc ccaggttcgg gtgctggtgg 1560 

tgatgactcc gtcgcccagc catgtggaat cgcaatgcca atttcctgag gaactaccca 1620 

ctccgtcggc gccttcgtta ttgtctgcca ttggagcgcc accgcctgca gccattgtac 1680 
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cagatcccag 


accagagggg 


cctgcggggg 


gttctccgat 


tggttgaggg 


tcgggcactg 


1740 


actctgagtc 


gccagtctgc 


ccaaagttga 


gtctcttttt 


cgcgggctgc 


tggcctttct 


1800 


tgccgatgcc 


cgtagaggag 


tctggagaac 


gctggggtga 


tggctctacc 


ggtctcttct 


1860 


ttccaggagc 


cgtcttagcg 


ccttcctcaa 


ccagaccgag 


aggttcgaga 


acccgcttct 


1920 


tggcctggaa 


gactgctcgc 


ccgaggttgc 


ccccaaaaga 


cgtatcttct 


tgcagacgct 


1980 


cctgaaactc 


ggcgtcggcg 


tggttatacc 


gcaggtacgg 


attgtcaccc 


gctttgagct 


2040 


gctggtcgta 


ggccttgtcg 


tgctcgaggg 


ccgctgcgtc 


cgccgcgttg 


acgggctccc 


2100 


ccttgtcgag 


tccgttgaag 


ggtccgaggt 


acttgtagcc 


aggaagcacc 


agaccccggc 


2160 


cgtcgtcctg 


cttttgctgg 


ttggctttgg 


gcttcggggc 


tccaggtttc 


agcgcccacc 


2220 


actcgcgaat 


gccctcagag 


aggttgtcct 


cgagccaatc 


tggaagataa 


ccatcggcag 


2280 


ccatacctga 


tttaaatcat 


ttattgttca 


aagatgcagt 


catccaaatc 


cacattgacc 


2340 


agatcgcagg 


cagtgcaagc 


gtctggcacc tttcccatga 


tatgatgaat 


gtagcacagt 


2400 


ttctgatacg 


cctttttgac 


gacagaaacg 


ggttgagatt 


ctgacacggg 


aaagcactct 


2460 


aaacagtctt 


tctgtccgtg 


agtgaagcag 


atatttgaat 


tctgattcat 


tctctcgcat 


2520 


tgtctgcagg 


gaaacagcat 


cagattcatg 


cccacgtgac 


gagaacattt 


gttttggtac 


2580 


ctgtctgcgt 


agttgatcga 


agcttccgcg 


tctgacgtcg 


atggctgcgc 


aactgactcg 


2640 


cgcacccgtt 


tgggctcact 


tatatctgcg 


tcactggggg 


cgggtctttt 


cttggctcca 


2700 


ccctttttga 


cgtagaattc 


atgctccacc 


tcaaccacgt 


gatcctttgc 


ccaccggaaa 


2760 


aagtctttga 


cttcctgctt 


ggtgaccttc 


ccaaagtcat 


gatccagacg 


gcgggtgagt 


2820 


tcaaatttga 


acatccggtc 


ttgcaacggc 


tgctggtgtt 


cgaaggtcgt 


tgagttcccg 


2880 


tcaatcacgg 


cgcacatgtt 


ggtgttggag 


gtgacgatca 


cgggagtcgg 


gtctatctgg 


2940 


gccgaggact 




gtccacgcgc 


accttgcttc 




rrrfr , t"t"'t"n'rrrT 
y y u l. l uy y ui^ 


3000 


gactccacga 


ccttggcggt 


catcttcccc 


tcctcccacc 


agatcaccat 


cttgtcgaca 


3060 


cagtcgttga 


agggaaagtt 


ctcattggtc 


cagttgacgc 


agccgtagaa 


agggcgaatt 


3120 


c 












3121 



<210> 14 
<211> 3131 
<212> DNA 

<213> new AAV serotype, clone 1-3 
<400> 14 

gcggccgcga attcgccctt ggctgcgtca actggaccaa tgagaacttt cccttcaatg 60 
attgcgtcga caagatggtg atctggtggg aggagggcaa gatgacggcc aaggtcgtgg 120 
agtccgccaa ggccattctc ggcggcagca aggtgcgcgt ggaccaaaag tgcaagtcgt 180 
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ccaacgggac 


atcgggagga 


gccaccaacg 


auaauaLL L a 


r "t" +• r rr rr r t" a r 

L» L L L y y L L CLL 


1680 


agcaccccct 


gggggtattt 


tgactttaac 


agattccact 


ttccacctttr 


accaccftaac 


1740 


tggcagcgac 


tcatcaacaa caactgggga ttccgaccca 


a fragrant" fact 


r *r t* r a a crc t" c 

L> L LL aa y L- LL 


1800 


ttcaacatcc 


aggtcaagga 


ggtcacgcag 


aatgaaggca 


era arrarra t* 

L>L>CLCiyClL>L>CLL 


r<trraafaar 
l y l l a a l a a l 


1860 


ctcaccagca 


ccatccaggt 


gtttacggac 


tcggagtacc 


agetgeegta 


cgttctcggc 


1920 


tctgtccacc 


agggctgcct 


gcctccgttc 


ccggcggacg 


tgttcatgat 


tccccagtac 


1980 


ggctacctaa 


cactcaacaa 


cggtagtcag 


gccgtgggac 


gctcctcctt 


ctactgcctg 


2040 


gaatactttc 


cttcgcagat 


gctgagaacc 


ggcaacaact 


tccagtttac 


ttacaccttc 


2100 
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gaggacgtgc ctttccacag cagctacgcc cacagctaga gcttggaccg gctgatgaat 2160 

cctctgattg accagtacct gtactacttg tctcggactc aaacaacagg aggcacggca 2220 

aatacgcaga ctctgggctt cagccaaggt gggcctaata caatggccaa tcaggcaaag 2280 

aactggctgc caggaccctg ttaccgccaa. caacgcgtct caacgacaac cgggcaaaac 2340 

aacaatagca actttgcctg gactgctggg accaaatacc atctgaatgg aagaaattca 2400 

ttggctaatc ctggcatcgc tatggcaaca cacaaagacg acgaggagcg tttttttccc 2460 

agtaacggga tcctgatttt tggcaaacaa aatgctgcca gagacaatgc ggattacagc 2520 

gatgtcatgc tcaccagcga ggaagaaatc aaaaccacta accctgtggc tacagaggaa 2580 

tacggtatcg tggcagataa cttgcagcag caaaacacgg ctcctcaaat tggaactgtc 2640 

aacagccagg gggccttacc cggtatggtc tggcagaacc gggacgtgta cctgcagggt 2700 

cccatctggg ccaagattcc tcacacggac ggcaacttcc acccgtctcc gctgatgggc 2760 

ggctttggcc tgaaacatcc tccgcctcag atcctgatca agaacacgcc tgtacctgcg 2820 

gatcctccga ccaccttcaa ccagtcaaag ctgaactctt tcatcacgca atacagcacc 2880 

ggacaggtca gcgtggaaat tgaatgggag ctgcagaagg aaaacagcaa gcgctggaac 2 940 

cccgagatcc agtacacctc caactactac aaatctataa gtgtggactt tgctgttaat 3000 

acagaaggcg tgtactctga accccgcccc attggcaccc gttacctcac ccgtaatctg 3060 

I 

taattgcctg ttaatcaata aaccggttga ttcgtttcag ttgaactttg gtctctgcga 3120 

agggcgaatt c 3131 

<210> 15 
<211> 3127 
<212> DNA 

<213> new AAV serotype, clone 13-3b 
<400> 15 

gcggccgcga attcgccctt cgcagagacc aaagttcaac tgaaacgaat caaccggttt 60 

attgattaac atgcaattac agattacggg tgaggtaacg agtgccaata gggcgaggct 120 

cagagtaaac accctggctg tcaacggcaa agtccacacc agtctgcttt tcaaagttgg 180 

aggtgtactg aatctccggg tcccagcgct tgctgttttc cttctgcagc tcccactcga 240 

tttccacgct gacttgtccg gtgctgtact gtgtgatgaa cgaagcaaac ttggcaggag 300 

taaacacctc cggaggatta gcgggaacgg gagtgttctt gatcaggatc tgaggaggcg 360 

gatgtttaag tccaaagccg cccatcaaag gagacgggtg aaagttgcca tccgtgtgag 420 

gaatcttggc ccagatggga ccctgcaggt acacgtcccg gttctgccag accatgccag 480 

gtaaggctcc ctggttgttg acaacttgtg tctgggctgc agtattagcc gcttgtaagt 540 

tgctgctgac tatcccgtat tcttccgtgg ctacaggatt agtaggacga atttcttctt 600 

catttgtcat taacacattt tccaatgtag ttttgttagt tgctccagtt tttccaaaaa 660 
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tcaggactcc gctggatggg aaaaagcggt cctcgtcgtc cttgtgagtt gccatggcga 720 
cgccgggatt aaccaacgag tttctgccgt tcaggtgata tttggtggca ccagtccaag 780 
caaagttgct gttgttgttt tgatccagcg ttttggagac cctttgttgc cggaagcagg 840 
gtccaggtaa ccaattcttg gcttgttcgg ccatagttga aggcccgccc tggtaaaact 900 
gcagttcccg attgccagct gtgcctcctg ggtcactctg tgttctggcc aggtagtaca 960 

agtactggtc gatgagggga ttcatcagcc ggtccaggct ctggctgtgt gcgtagctgc 1020 

tgtggaaagg cacgtcctcg aagctgtagc tgaactcaaa gttgttgccc gttctcagca 1080 

tctgagaggg gaagtactcc aggcagtaga aggaggaacg tcccacagac tgactgccat 1140 

tgttgagagt caggtagccg tactgaggaa tcatgaagac gtccgccggg aacggaggca 1200 

ggcagccctg gtgcgcagag ccgaggacgt acggcagctg gtattccgag tccgagaata 12 60 

cctgaatcgt gctggtaagg ttattagcga tggtcgtaac gccgtcattc gtcgtgacct 1320 

ccttgacctg gatgttgaag agcttgaacc gcagcttctt gggccggaat ccccagttgt 1380 

tgttgatgag tcgctgccag tcacgtggtg agaagtggca gtggaatctg ttaaagtcaa 1440 

aataccccca gggggtgctg tagccgaagt aggtgttgtc gttggtacta cctgcagttt 1500 

cactggagat ttgctcgtag aggtggttgt tgtaggtggg cagggcccag gttcgggtgc 1560 

tggtggtaat gactctgtcg cccagccatg tggaatcgca atgccaattt cctgaggcat 1620 

tacccactcc gtcggcacct tcgttattgt ctgccattgg tgcgccaccg cctgcagcca 1680 

ctgtaccaga tcccacacta gagggcgctg ctggaggttc tccgagaggt tgagggtcgg 1740 

ggactgactc tgagtcgcca gtctgaccga aattgagtct ctttctggcg ggctgctggc 1800 

ccttcttgcc gatgcccgtg gaggagtcgg gggaacgctg aggtgacggc tctaccggtc 1860 

tcttctttgc aggagccgtc ttagcgcctt cctcaaccag accgagaggt tcgagaaccc 1920 

gcttcttggc ctggaagact gctcgcccga ggttgccccc aaatgacgta tcttcttgca 1980 

gacgctcctg aaactcggcg tcggcgtggt tataccgcag gtacgggttg tcacccgcat 2040 

tgagctgctg gtcgtaggcc ttgtcgtgct cgagggccgc tgcgtccgcc gcgttgacgg 2100 

gctccccctt gtcgagtccg ttgaagggtc cgaggtactt gtagccagga agcaccagac 2160 

cccggccgtt gtcctgcttt tgctggttgg ctttgggttt cggggctcca ggtttcaggt 2220 
cccaccactc gcgaatgccc tcagagaggt tgtcctcgag ccaatctgga agataaccat 2280 
cggcagccat acctgattta aatcatttat tgttcaaaga tgcagtcatc caaatccaca 2340 
ttgaccagat cgcaggcagt gcaagcgtct ggcacctttc ccatgatatg atgaatgtag 2400 
cacagtttct gatacgcctt tttgacgaca gaaacgggtt tagattctga cacgggaaag 24 60 
cactctaaac agtctttctg tccgtgagtg aagcagatat ttgaattctg attcattctc 2520 
tcgcattgtc tgcagggaaa cagcatcaga ttcatgccca cgtgacgaga acatttgttt 2580 

33 



WO 03/042397 



PCT/US02/33629 



tggtacctgt ctgcgtagtt gatcgaagct tccgcgtctg acgtcgatgg ctgcgcaact 2640 

gactcgcgca cccgtttggg ctcacttata tctgcgtcac tgggggcggg tcttttcttg 2700 

gctccaccct ttttgacgta. gaattcatgc tccacctcaa ccacgtaatc ctttgcccac 27 60 

cggaaaaagt ctttgacttc ctgcttggtg accttcccaa agtcatgatc cagacggcgg 2820 

gtgagttcaa atttgaacat ccggtcttgc aacggctgct ggtgttcgaa ggtcgttgag 2880 

ttcccgtcga tcacggcgca catgttggtg ttggagatga cgatcgcggg agtcgggtct 2940 

atctgggccg aggacttgca tttctggtcc acgcgcacct tgcttcctcc gagaatggct 3000 

ttggccgact ccacgacctt ggcggtcatc ttcccctcct cccaccagat caccatcttg 3060 

tcgacacagt cgttgaaggg aaagttctca ttggtccagt tgacgcagcc gtagaaaggg 3120 

cgaattc 3127 

<210> 16 
<211> 3106 
<212> DNA 

<213> new AAV serotype, clone 24-1 
<400> 16 

gcggccgcga attcgccctt cgcagagacc aaagttcaac tgaaacgaat caaccggttt 60 

attgattaac aagtaattac aggttacggg tgaggtaacg ggtgccaatg gggcgaggct 120 

cagtataaac cccttcgttg ttgacagcaa attccacatt attagacttg gcataatttg 180 

aggtgtactg aatctctgga ttccagcgtt tgctgttttc tttctgcagt tcccactcga 240 

tctccacgct gacctggccg gtgctgtact gcgtgataaa tgaggcaaac ttggcaggag 300 

taaacacctc tggaggatta gcaggtaccg gggtgttttt gatgagaatt tgaggaggcg 360 

ggtgtttgag tccaaatccg cccatcaggg gagacgggtg aaagttgccg tccgtgtgag 420 

gaattttggc ccagatggga ccctgcaggc acacgtcccg gttctgccag accatgccgg 48 0 

gcagagcccc ctggctgttg acagtctgtg tctggggtcc ggccgtagac gattgcaggt 540 

tgctggagac cacaccgtat tcttctgtag ccacgggatt ggtggttttg atctcctcct 600 

cgctggtcat tagcacgttt tccagcgttg tcttgttggc agcccccgtt ttgccaaaaa 660 

ccagcactcc gttgatggga aagaactggt cctcgtcgtc cttgttggtg gccatggcta 720 

cgcccgggtt ggttaatgaa tttctaccat tcagatggta tttagtggcc ccggtccagg 780 

caaagttact gttgttgttg ctgtctatgt tttttgacag tctctgctgc cgataacagg 840 

gtccgggcag ccagttcttt gattgctcgg ccatggtgtt gggcccagcc tgatggaact 900 

gcagctccct tgtggacccc gtagtgctct gggtccgggc caggtagtac aggtactggt 960 

cgatgagggg attcatcagc cggtctaggc tctggctgtg cacatagctg ctgtggaaag 1020 

gcacttcctc aaaggtgtag ctgaattcaa agttattgcc cgttctcagc atctgagaag 1080 
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gaaagtactc caggcagtag aaggaggaac gtcccacaga ctgactgccg ttgtttagag 1140 

tcagatatcc gtactgagga atcatgaaca cgtccgcagg gaacggaggg aggcagccct 1200 

ggtgcgcaga gccgaggacg tacggcagtt ggtactccga gtccgagaag acctgaatcg 1260 

tgctggtaag gttattagcg atggtcgtaa cgccgtcgtt cgtcgtgacc tccttgacct 1320 

ggatgttgaa caacttgaac cgcagctttc tgggccggaa tccccagttg ttgttgatga 1380 

gtcgctgcca gtcacgtggt gagaagtggc agtggaatct gttgaagtca aaatagcccc 1440 

agggggtgct gtagctgaag aagtggttgt cgttggtagc cccgctctga cttgatatct 1500 

gcttgtagag gtggttgttg taggtgggca gggcccaggt gcgggtgctg gtggtgatga 1560 

ctctgtcgcc cagccatgtg gaatcgcaat gccaatttcc ggaggcatta cccactccgt 1620 

cggcgccttc gttattgtct gccattggtg cgccaccgcc tgcagccatt gtaccagatc 1680 

ccagacctga gggcgcggcg ggaggttctc cgagaggttg ggggtcgggc actgactctg 1740 

agtcgccagt ctgcccaaag ttgagcttct ttttagcggg ctgctggcct ttcttgccga 1800 

tgcccgtgga ggagtcgggg gattctatgg gtctcttctt tccaggagcc gtcttagcga 1860 

cttcctcaac cagaccgaga ggttcgagaa cccgcttctt ggcctggaag actgctcgcc 1920 

cgaggttgcc cccaaaagac gtatcttctt gaagacgctc ctgaaactcg gcgtcggcgt 1980 

ggttgtactt gaggtacggg ttgtccccct gctcgagctg cttgtcgtag gccttgtcgt 2040 

gctcgagggc cgcggcgtct gcctcgttga ccggctctcc cttgtcgagt ccgttgaagg 2100 

gtctgaggta cttgtagcca ggaagcacca gaccccggcc gtcgtcctgc ttttgctggt 2160 

tggctttggg tttcggggct ccaggtttca agtcccacca ctcgcgaatg ccctcagaga 2220 

ggttgtcctc gagccaatct ggaagataac catcggcagc catacctggt ttaagtcatt 2280 

tattgctcag aaacacagtc atccaggtcc acgttgacca gatcgcaggc cgagcaagca 2340 

atctcgggag cccgccccag cagatgatga atggcacaga gtttccgata cgtcctcttt 2400 

ctgacgaccg gttgagattc tgacacgccg gggaaacatt ctgaacagtc tctggtcccg 2460 

tgcgtgaagc aaatgttgaa attctgattc actctctcgc atgtcttgca gggaaacagc 2520 

atctgaagca tgcccgcgtg acgagaacat ttgttttggt acctgtcggc aaagtccacc 2580 

ggagctcctt ccgcgtctga cgtcgatgga ttcgcgactg aggggcaggc ccgcttgggc 2640 

tcgcttttat ccgcgtcatc gggggcgggt ctcttgttgg ccccaccctt tctgacgtag 2700 

aacccatgcg ccacctcggt cacgtgatcc tgcgcccagc ggaagaacct tttgacttcc 27 60 

tgctttgtca ccttgccaaa gttatgctcc agacggcggg tgggttcaaa tttgaacatc 2820 

cggtcctgca acggctgctg gtgctcgaag gtggcgctgt tcccgtcaat cacggcgcac 2880 

atgttggtgt tggaggtgac ggtcacgggg gtggggtcga tctgggcgga cgacttgcac 2940 

ttttggtcca cgcgcacctt gctgccgccg agaatggcct tggcggactc cacgaccttg 3000 
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gccgtcatct tgccctcctc ccaccagatc accatcttgt cggcgcaatc gttgaaggga 3060 

aagttctcat tggtccagtt gacgcagccg tagaaagggc gaattc 3106 

<210> 17 
<211> 3102 
<212> DNA 

<213> new AAV serotype, clone 27-3 
<400> 17 

gcggccgcga attcgccctt cgcagagacc aaagttcaac tgaaacgaat caaccggttt 60 

attgattaac aagtaattac aggttacggg tgaggtaacg ggtgccaatg gggcgaggct 120 

cagtataaac cccttcgttg ttgacagcaa attccacatt attagacttg gcataatttg 180 

aggtgtactg aatctctgga ttccagcgtt tgctgttttc tttctgcagt tcccactcga 240 

tctccacgct gacctggccg gtgctgtact gcgtgataaa tgaggcaaac ttggcaggag 300 

taaacacctc tggaggatta gcaggtaccg gggtgttttt gatgagaatt tgaggaggcg 360 

ggtgtttgag tccaaatccg cccatcaggg gagacgggtg aaagttgccg tccgtgtgag 420 

gaatttcggc ccagatggga ccctgcaggt acacgtcccg gttctgccag accatgccgg 480 

gcagagcccc ctggctgttg acagtctgtg tccggggtcc ggccgtagac gattgcaggt 540 

tgctggagac cacaccgtat tcttctgtag ccacgggatt ggtggttttg atctcctcct 600 

cgctggtcat tagcacgttt tccagcgttg tcttgttggc agcccccgtt ttgccaaaaa • 660 

ccagcactcc gttgatggga aggaactggt cctcgtcgtc cttgttggtg gccatggcta 720 

cgcccgggtt ggttaatgaa tttctaccat tcagatggta tttagtggcc ccggtccagg 780 

caaagttact gttgttgttg ctgtctatgt tttttgacag tctctgctgc cgataacagg 840 

gtccgggcag ccagttcttt gattgctcgg ccacggtgtt gggcccagcc tgatggaact 900 

gcagctccct tgtggacccc gtagtgctct gggtccgggc caggtagtac aggtactggt 960 

cgatgagggg attcatcagc cggtccaggc tctggctgtg cgcatagctg ctgtggaaag 1020 

gcacttcctc aaaggtgtag ctgaattcaa agttattgcc cgttctcagc atctgagaag 1080 

gaaagtactc caggcagcag aaggaggaac gtcccacaga ctgactgccg ttgtttagag 1140 

tcagatatcc gtactgagga atcatgaaca cgtccgcagg gaacggaggg aggcagccct 1200 

ggtgcgcaga gccgaggacg tacggcagtt ggtactccga gtccgagaag acctgaatcg 1260 

tgctggtaag gttattagcg atggtcgtaa cgccgtcgtt cgtcgtgacc tccttgacct 1320 

ggatgttgaa caacttgaac cgcagctttc tgggccggaa tccccagttg ttgttgatga 1380 

gtcgctgcca gtcacgtggt gagaagtggc agtggaatct gttgaagtca aaatagcccc 1440 

agggggtgct gtagccgaag aagtggttgt cgttggtagc cccgctctga cttgatatct 1500 

gcttgtagag gtggttgttg taggtgggca gggcccaggt gcgggtgctg gtggtgatga 1560 

ctctgtcgcc cagccatgtg gaatcgcaat gccaatttcc ggaggcatta cccactccgt 1620 
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cggcgccttc gttattgtct gccattggtg cgccaccgcc tgcagccatt gtaccagatc 1680 

ccagacctga gggcgcggcg ggaggttctc cgagaggttg ggggtcgggc actgactctg 1740 

agtcgccagt ctgcccaaag ttgagcttct ttttagcggg ctgctggcct ttcttgccga 1800 

tgcccgtgga ggagtcgggg gattctatgg gtctcttctt tccggaagcc gtcttagcgc 1860 

cttcctcaac cagaccgaga ggttcgagaa cccgcttctt ggcctggaag actgctcgcc 1920 

cgaggttgcc cccaaaagac gtatcttctt gaagacgctc ctgaaactcg gcgtcggcgt 1980 

ggttgtactt gaggtacggg ttgtccccct gctcgagctg cttgtcgtag gccttgtcgt 2040 

gctcgagggc cgcggcgtct gcctcgttga ccggctctcc cttgtcgagt ccgttgaagg 2100 

gtccgaggta cttgtagcca ggaagcacca gaccccggcc gtcgtcctgc ttttgctggt 2160 

tggctttggg tttcggggct ccaggtttca agtcccacca ctcgcgaatg ccctcagaga 2220 

ggttgtcctc gagccaatct ggaagataac catcggcagc catacctggt ttaagtcatt 2280 

tattgctcag aaacacagtc atccaggtcc acgttgacca gatcgcaggc cgagcaagca 2340 

atctcgggag cccgccccag cagatgatga atggcacaga gtttccgata cgtcctcttt 2400 

ctgacgaccg gttgagattc tgacacgccg gggaaacatt ctgaacagtc tctggtcccg 2460 

tgcgtgaagc aaatgttgaa attctgattc attctctcgc atgtcttgca gggaaacagc 2520 

atctgaagca tgcccgcgtg acgagaacat ttgttttggt acctgtcggc aaagtccacc 2580 

ggagctcctt ccgcgtctga cgtcgatgga tccgcgactg aggggcaagc ccgcttgggc 2640 

tcgcttttat ccgcgtcatc gggggcgggt ctcttgttgg ctccaccctt tctgacgtag 2700 

aactcatgcg ccacctcggt cacgtgatcc tgcgcccagc ggaagaactc tttgacttcc 2760 

tgctttgtca ccttgccaaa gtcatgctcc agacggcggg tgagttcaaa tttgaacatc 2820 

cggtcttgta acggctgctg gtgctcgaag gtggtgctgt tcccgtcaat cacggcgcac 2880 

atgttggtgt tggaagtgac gatcacgggg gtgggatcga tctgggcgga cgacttgcac 2940 

ttttggtcca cgcgcacctt gctgccgccg agaatggcct tggcggactc cacgaccttg 3000 

gccgtcatct tgccctcctc ccaccagatc accatcttgt cgacgcaatc gttgaaggga 3060 

aagttctcat tggtccagtt gacgcagccg aagggcgaat tc 3102 

<210> 18 
• <211> 3106 
<212> DNA 

<213> new AAV serotype, clone 7-2 
<400> 18 

gcggccgcga attcgccctt cgcagagacc aaagttcaac tgaaacgaat cagccggttt 60 

attgattaac aagtaattac aggttacggg tgaggtaacg ggtgccaatg gggcgaggct 120 

cagtataaac cccttcgttg ttgacagcaa attccacatt attagacttg gcataatttg 180 
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aggtgtactg aatctctgga ttccagcgtt tgctgttttc tttctgcagt tcccactcga 240 

tctccacgct gacctggccg gtgctgtact gcgtgataaa tgaggcaaac ttggcaggag 300 

taaacacctc tggaggatta gcaggtaccg gggtgttttt gatgagaatt tgaggaggcg 360 

ggtgtttgag tccaaatccg cccatcaggg gagacgggtg aaagttgccg tccgtgtgag 420 

gaattttggc ccagatggga ccctgcaggt acacgtcccg gttctgccag accatgccgg 480 

gcagagcccc ctggctgttg acagtctgtg tctggggtcc ggccgtagac gattgcaggt 540 

tgctggagac cacaccgtat tcttctgtag ccacgggatt ggtggttttg atctcctcct 600 

cgctggtcat tagcacgttt tccagcgttg tcttgttggc agcccccgtt ttgccaaaaa 660 

ccagcactcc gttgatggga aagaactggt cctcgtcgtc cttgttggtg gccatggcta 720 

cgcccgggtt ggttaatgaa tttctaccat tcagatggta tttagtggcc ccggtccagg 780 

caaagttact gttgttgttg ctgtctatgt tttttgacag tctctgctgc cgataacagg 840 

gtccgggcag ccagttcttt gattgctcgg ccatggtgtt gggcccagcc tgatggaact 900 

gcagctccct tgtggacccc gtagtgctct gggtccgggc caggtagtac aggtactggt 960 

cgatgagggg attcatcagc cggtccaggc tctggctgtg cgcatagctg ctgtggaaag 1020 

gcacttcctc aaaggtgtag ctgaattcaa agttatcgcc cgttctcagc atctgagaag 1080 

gaaagtactc caggcagtag aaggaggaac gtcccacaga ctgactgccg ttgtttagag 1140 

tcagatatcc gtactgagga atcatgaaca cgtccgcagg gaacggaggg aggcagccct 1200 

ggtgcgcaga gccgaggacg tacggcagtt ggtactccga gtccgagaag acctgaatcg 1260 

tgctggtaag gttattagcg atggtcgtaa cgccgtcgtt cgtcgtgacc tccttgacct 1320 

ggatgttgaa caacttgaac cgcagctttc tgggccggaa tccccagttg ttgttgatga 1380 

gtcgctgcca gtcacgtggt gagaagtggc agtggaatct gttgaagtca aaatagcccc 1440 

agggggtgct gtagccgaag aagtggttgt cgttggtagc cccgctctga cttgatatct 1500 

gcttgtagag gtggttgttg taggtgggca gggcccaggt gcgggtgctg gtggtgatga 1560 

ctctgtcgcc cagccatgtg gaatcgcaat gccaatttcc ggaggcatta cccactccgt 1620 

cggcgccttc gttattgtct gccattggtg cgccaccgcc tgcagccatt gtaccagatc 1680 

ccagacctga gggcgcggcg ggaggttctc cgagaggttg ggggtcgggc actgactctg 1740 

agtcgccagt ctgcccaaag ttgagcttct ttttagcggg cggctggccg ttcttgccga 1800 

tgcccgtgga ggagtcgggg gattctatgg gtctcttctt tccaggagcc gtcttagcgc 1860 

cttcctcaac cagaccgaga ggtfcgagaa cccgcttctt ggcctggaag actgctcgcc 1920 

cgaggttgcc cccaaaagac gtatcttctt gaagacgctc ctgaaactcg gcgtcggcgt 1980 

ggttgtactt gaggtacggg ttgtccccct gctcgagctg cttgtcgtag gccttgtcgt 2040 

gctcgagggc cgcggcgtct gcctcgttga ccggctctcc cttgtcgagt ccgttgaagg 2100 
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gtccgaggta 


cctgt ageca 


ggaagcacca 


gaccccggcc 


gtcgtcctgc 


ttttgctggt 


2160 


tggctttggg 


tttegggget 


ccaggtttca 


agtcccacca 


etcgegaatg 


ccctcagaga 


2220 


ggttgccctc 


gagecaatet 


ggaagataac 


catcggcagc 


catacctggt 


ttaagtcatt 


2280 


t attgctcag 


aaacacagtc 


atccaggtcc 


acgttggcca 


gategcagge 


cgagcaagca 


2340 


atctcgggag 


cccgccccag 


cagat gatga 


atggcacaga 


gtttccgata 


cgtcctcttt 


2400 


ctgacgaccg 


gttgagattc 


t gacacgccg 


gggaaacatt 


ctgaacagtc 


tctggtcccg 


2460 


tgcgtgaagc 


aaat gttgaa 


att ctgattc 


arte tetege 


atgtcttgca 


ggggaacagc 


2520 


atctgaagca 


tgcccgcgtg 


acgagaacat 


ttgttttggt 


acctgtcggc 


aaagtccacc 


2580 


ggagctcctt 


ccgcgtctga 


cgtcgatgga 


tccgcgactg 


aggggcaggc 


ccgcttgggc 


2640 


tcgcttttat 


ccgcgtcatc 


gggggcgggt 


ctcttgttgg 


ctccaccctt 


tetgaegtag 


2700 


aactcatacg 


ccacctcggt 


cacgtgatcc 


tgcgcccagc 


ggaagaactc 


tttgacttcc 


2760 


t gctt t g tea 


ccttgccaaa 


gtcatgctcc 


agaeggeggg 


tgagttcaaa 


tttgaacatc 


2820 


cygtCutyta 


ac ggctgctg 


gtget cgaag 


gtggtgctgt 


tcccgtcaat 


cacggcgcac 


2880 


dLy utyy ty t 


Lggaagugac 


gat caegggg 


gtgggatcga 


tctgggcgga 


cgacttgcac 


2940 


ttttggtcca 


cgcgcacctt 


gctgccgccg 


agaatggect 


tggeggaetc 


cacgaccttg 


3000 


gccgtcatcc 


tgccctcctc 


ccaccagatc 


accatcttgt 


cgacgcaatc 


gttgaaggga 


3060 


aagttctcat 


tggtccagtt 


gacgcagccg 


tagaaagggc 


gaattc 




3106 



<210> 19 

<211> 3105 

<212> DNA 

<213> new AAV serotype, clone CI 

<400> 19 



gaattcgccc 


ttgctgcgtc 


aactggacca 


atgagaactt 


tcccttcaac 


gattgegteg 


60 


acaagatggt 


gatctggtgg 


gaggagggca 


agatgacege 


caaggtegtg 


gagtccgcca 


120 


aggecattet 


gggeggaage 


aaggtgcgcg tggaccaaaa 


gtgeaagtea 


tcggcccaga 


180 


tcgaccccac 


gcccgtgatc 


gtcacctcca 


acaccaacat 


gtgcgccgtg 


ategaeggga 


240 


acagcaccac 


cttcgagcac 


cagcagccgc 


tgcaggaccg 


catgttcaag 


ttcgagctca 


300 


cccgccgtct 


ggagcacgac 


tttggcaagg 


tgaccaagca 


ggaagtcaaa 


gagttcttcc 


360 


gctgggctca 


ggatcacgtg 


actgaggtgg 


cgcatgagtt 


etaegtcaga 


aagggeggag 


420 


ccaccaaaag 


acccgccccc 


agtgacgcgg 


atataagega 


gcccaagcgg 


gcctgcccct 


480 


cagttgcgga 


gccatcgacg 


teagacgegg 


aagcaccggt 


ggactttgcg 


gacaggtacc 


540 


aaaacaaatg 


ttctcgtcac 


gcgggcatgc 


ttcagatget 


gtttccctgc 


aagacatgcg 


600 


agagaatgaa 


tcagaatttc 


aaegtctget 


tcacgcacgg 


ggtcagagac 


tgctcagagt 


660 


gcttccccgg 


cgcgtcagaa 


tctcaacccg tegtcagaaa 


aaagaegtat 


cagaaactgt 


720 
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gcgcgattca 


tcatctgctg 


gggcgggcac 


ccgagattgc 


gtgttcggcc 


cgcgatctcg 


780 


tcaacgtgga 


cttggatgac 


tgtgtttctg 


agcaataaat 


gacttaaacc 


aggtatggct 


840 


gctgacggtt 


atcttccaga 


ttggctcgag 


gacaacctct 


ctgagggcat 


tcgcgagtgg 


900 


tgggacctga 


aacctggagc 


ccccaagccc 


aaggccaacc 


agcagaagca 


ggacgacggc 


960 


cggggtctgg 


tgcttcctgg 


ctacaagtac 


ctcggaccct 


tcaacggact 


cgacaagggg 


1020 


gagcccgtca 


acgcggcgga 


cgcagcggcc 


ctcgagcacg 


acaaggccta 


cgaccagcag 


1080 


ctcaaagcgg 


gtgacaatcc 


gtacctgcgg 


tataaccacg 


ccgacgccga 


gtttcaggag 


1140 


cgtctgcaag 


aagatacgtc 


ttttgggggc 


aacctcgggc 


gagcagtctt 


ccaggccaag 


1200 


aagagggtac 


tcgaacctct 


gggcctggtt 


gaagaaggtg 


ctaagacggc 


tcctggaaag 


1260 


aagagaccgt 

t. 


tagagtcacc 


acaagagccc 


gactcctcct 


caggaatcgg 


caaaaaaggc 


1320 


aaacaaccag 


ccaaaaagag 


actcaacttt 


gaagaggaca 


ctggagccgg 


agacggaccc 


1380 


cctgaaggat 


cagataccag 


cgccatgtct 


tcagacattg 


aaatgcgtgc 


agcaccgggc 


1440 


ggaaatgctg 


tcgatgcggg 


acaaggttcc 


gatggagtgg 


gtaatgcctc 


gggtgattgg 


1500 


cattgcgatt 


ccacctggtc 


tgagggcaag 


gtcacaacaa cctcgaccag 


aacctgggtc 


1560 


ttgcccacct 


acaacaacca 


cttgtacctg 


cggctcggaa 


caacatcaaa 


cagcaacacc 


1620 


tacaacggat 


tctccacccc 


ctggggatac 


tttgacttta 


acagattcca 


ctgtcacttc 


1680 


tcaccacgtg 


actggcaaag 


actcatcaac 


aacaactggg 


gactacgacc 


aaaagccatg 


1740 


cgcgttaaaa 


tcttcaatat 


ccaagttaag 


gaggtcacaa 


cgtcgaacgg 


cgagactacg 


1800 


gtcgctaata 


accttaccag 


cacggttcag 


atatttgcgg 


actcgtcgta 


tgagctcccg 


1860 


tacgtgatgg 


acgctggaca 


agagggaagt 


ctgtctcctt 


tccccaatga 


cgtcttcatg 


1920 


gtgcctcaat 


atggctactg 


tggcattgtg 


actggcgaaa 


atcagaacca 


gacggacaga 


1980 


aatgctttct 


actgcctgga 


gtattttcct 


tcacaaatgc 


tgagaactgg caataacttt 


2040 


gaaatggctt 


acaactttgg 


gaaggtgccg 


ttccactcaa 


tgtatgctta 


cagccagagc 


2100 


ccggacagac 


tgatgaatcc 


cctcctggac 


cagtacctgt 


ggcacttaca 


gtcgaccacc 


2160 


tctggagaga 


ctctgaatca 


aggcaatgca 


gcaaccacat 


ttggaaaaat 


caggagtgga 


2220 


gactttgcct 


tttacagaaa 


gaactggctg 


cctgggcctt 


gtgttaaaca 


gcagagactc 


2280 


tcaaaaactg 


ccagtcaaaa 


ttacaagatt 


cctgccagcg 


ggggcaacgc 


tctgttaaag 


2340 


tatgacaccc 


actatacctt 


aaacaaccgc 


tggagcaaca 


tagcgcctgg 


acctccaatg 


2400 


gcaacagctg 


gaccttcaga 


tggggacttc 


agcaacgccc 


agctcatctt 


ccctggacca 


2460 


tcagtcaccg 


gaaacacaac 


aacctcagca 


aacaatctgt 


tgtttacatc 


agaagaagaa 


2520 


attgctgcca 


ccaacccaag 


agacacggac 


atgtttggtc 


agattgctga 


caataatcag 


2580 


aatgctacaa 


ctgctcccat 


aaccggcaac 


gtgactgcta 


tgggagtgct 


tcctggcatg 


2640 
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gtgtggcaaa acagagacat ttactaccaa gggccaattt gggccaagat cccacacgcg 2700 

gacggacatt ttcatccttc accgctaatt ggcggttttg gactgaaaca tccgcctccc 2760 

cagatattta tcaaaaacac ccccgtacct gccaatcctg cgacaacctt cactgcagcc 2820 

agagtggact ctttcatcac acaatacagc accggccagg tcgctgttca gattgaatgg 2880 

gaaatcgaaa aggaacgctc caaacgctgg aatcctgaag tgcagtttac ttcaaactat 2940 

gggaaccagt cttctatgtt gtgggctccc gatacaactg ggaagtatac agagccgcgg .3000 

gttattggct ctcgttattt gactaatcat ttgtaactgc ctagttaatc aataaaccgt 3060 

gtgattcgtt tcagttgaac tttggtctct gcgaagggcg aattc 3105 

<210> 20 
<211> 3105 
<212> DNA 

<213> new AAV serotype, clone C3 
<400> 20 

gaattcgccc ttgctgcgtc aactggacca atgagaactt tcccttcaac gattgcgtcg 60 

acaagatggt gatctggtgg gaggagggca agatgaccgc caaggtcgtg gagtccgcca 120 

aggccattct gggcggaagc aaggtgcgcg tggaccaaaa gtgcaagtca tcggcccaga 180 

tcgaccccac gcccgtgatc gtcacctcca acaccaacat gtgcgccgtg atcgacggga 240 

acagcaccac cttcgagcac cagcagccgc tgcaggaccg catgttcaag ttcgagctca 300 

cccgccgtct ggagcacgac tttggcaagg tgaccaagca ggaagtcaaa gagttcttcc 360 

gctgggctca ggatcacgtg actgaggtgg cgcatgagtt ctacgtcaga aagggcggag 420 

ccaccaaaag acccgccccc agtgacgcgg atataagcga gcccaagcgg gcctgcccct 480 

cagttgcgga gccatcgacg tcagacgcgg aagcaccggt ggactttgcg gacaggtacc 540 

aaaacaaatg ttctcgtcac gcgggcatgc ttcagatgct gtttccctgc aagacatgcg 600 

agagaatgaa tcagaatttc aacgtctgct tcacgcacgg ggtcagagac tgctcagagt 660 

gcttccccgg cgcgtcagaa tctcaacccg tcgtcagaaa aaagacgtat cagaaactgt 720 

gcgcgattca tcatctgctg gggcgggcac ccgagattgc gtgttcggcc tgcgatctcg 780 

tcaacgtgga cttggatgac tgtgtttctg agcaataaat gacttaaacc aggtatggct 84 0 

gctgacggtt atcttccaga ttggctcgag gacaacctct ctgagggcat tcgcgagtgg 900 

tgggacctga aacctggagc ccccaagctc aaggccaacc agcagaagca ggacgacggc 9 60 

cggggtctgg tgcttcctgg ctacaagtac ctcggaccct tccacggact cgacaagggg 1020 

gagcccgtca acgcggcgga cgcagcggcc ctcgagcacg acaaggccta cgaccagcag 1080 

ctcaaagcgg gtgacaatcc gtacctgcgg tataaccacg ccgacgccga gtttcaggag 1140 

cgtctgcaag aagatacgtc ttttgggggc aacctcgggc gagcagtctt ccaggccaag 1200 
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aagagggtac tcgaaccact gggcctggtt gaagaaggtg ctaagacggc tcctggaaag 1260 

aagagaccgt tagagtcacc acaagagccc gactcctcct caggaatcgg caaaaaaggc 1320 

aaacaaccag ccaaaaagag actcaacttt gaagaggaca ctggagccgg agacggaccc 1380 

cctgaaggat cagataccag cgccatgtct tcagacattg aaatgcgtgc agcaccgggc 1440 

ggaaatgctg tcgatgcggg acaaggttcc gatggagtgg gtaatgcctc gggtgattgg 1500 

cattgcgatt ccacctggtc tgagggcaag gtcacaacaa cctcgaccag aacctgggtc 1560 

ttgcccacct acaacaacca cttgtacctg cggctcggaa caacatcaaa cagcaacacc 1620 

tacaacggat tctccacccc ctggggatac tttgacttta acagattcca ctgtcacttc 1680 

tcaccacgtg actggcaaag actcatcaac aacaactggg gactacgacc aaaagccatg 1740 

cgcgttaaaa tcttcaatat ccaagttaag gaggtcacaa cgtcgaacgg cgagactacg 1800 

gtcgctaata accttaccag cacggttcag atatttgcgg actcgtcgta tgagctcccg 1860 

tacgtgatgg acgctggaca agagggaagt ctgcctcctt tccccaatga cgtcttcatg 1920 

gtgcctcaat atggctactg tggcattgtg actggcgaaa atcagaacca gacggacaga 1980 

aatgctttct actgcctgga gtattttcct tcacaaatgc tgagaactgg caataacttt 2040 
gaaatggctt acaactttga gaaggtgccg ttccactcaa tgtatgctca cagccagagc 2100 
ctggacagac tgatgaatcc cctcctggac cagtacctgt ggcacttaca gtcgaccacc 2160 
tctggagaga ctctgaatca aggcaatgca gcaaccacat ttggaaaaat caggagtgga 2220 
gactttgcct tttacagaaa gaactggctg cctgggcctt gtgttaaaca gcagagattc 2280 
tcaaaaactg ccagtcaaaa ttacaagatt cctgccagcg ggggcaacgc tctgttaaag 2340 
tatgacaccc actatacctt aaacaaccgc tggagcaaca tagcgcctgg acctccaatg 2400 
gcaacagctg gaccttcaga tggggacttc agcaacgccc agctcatctt ccctggacca 2460 
tcagtcaccg gaaacacaac aacctcagca aacaatctgt tgtttacatc agaaggagaa 2 52 0 
attgctgcca ccaacccaag agacacggac atgtttggtc agattgctga caataatcag 2580 
aatgctacaa ctgctcccat aaccggcaac gtgactgcta tgggagtgct tcctggcatg 2640 
gtgtggcaaa acagagacat ttactaccaa gggccaattt gggccaagat cccacacgcg 2700 
gacggacatt ttcatccttc accgctaatt ggcggttttg gactgaaaca tccgcctccc 2760 
cagatattta tcaaaaacac ccccgtacct gccaatcctg cgacaacctt cactgcagcc 2820 
agagtggact ctttcatcac acaatacagc accggccagg tcgctgttca gattgaatgg 2880 
gaaatcgaaa aggaacgctc caaacgccgg aatcctgaag tgcagtttac ttcaaactat 2940 
gggaaccagt cttctatgtt gtgggctccc gatacaactg ggaagtatac agagccgcgg 3000 
gttattggct ctcgttattt gactaatcat ttgtaactgc ctagttaatc aataaaccgt 3060 
gtgattcgtt tcagttgaac tttggtctct gcgaagggcg aattc 3105 
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<210> 21 
<211> 3105 
<212> DNA 

<213> new AAV serotype, clone C5 

gaattcgccc ttcgcagaga ccaaagttca actgaaacga atcacacggt ttattgatta 60 

actaggcagt tacaaatgat tagtcaaata acgagagcca ataacccgcg gctctgtata 120 

cttcccagtt gtatcgggag cccacaacat agaagactgg ttcccacagt ttgaagtaaa 180 

ctgcacttca ggattccagc gtttggagcg ttccttttcg atttcccatt caatctgaac 240 

agcgacctgg ccggtgctgt attgtgtgat gaaagagtcc actctggctg cagtgaaggt 300 

tgtcgcagga taggcaggta cgggggtgtt tttgataaat atctggggag gcggatgttt 360 

cagtccaaaa ccgccaatta gcggtgaagg atgaaaatgt ccgtccgcgt gtgggatctt 420 

ggcccaaatt ggcccttggt agtaaatgtc tctgttttgc cacaccatgc caggaagcac 480 

tcccatagca gtcacgttgc cggttatggg agcagttgta gcattctgat tattgtcagc 540 

aatctgacca aacatgtccg tgtctcttgg gttggtggca gcaatttctt cttctgatgt 600 

aaacaacaga ttgtttgctg aggttgttgt gtttccggtg actgatggtc cagggaagat 660 

gagctgggcg ttgctgaagt ccccatctga aggtccagct gttgccattg gaggtccagg 720 

cgctatgttg ctccagcggt tgtttaaggt atagtgggtg tcatacttta acagagcgtt 780 
gcccccgctg gcaggaatct tgtaattttg actggcagtt tttgagaatc tctgctgttt 
aacacaaggc ccaggcagcc agttctttct gtaaaaggca aagtctccac tcctgatttt 
tccaaatgtg gttgctgcat tgccttgatt cagagtctct ccagaggtgg tcgactgtaa 

gtgccacagg tactggtcca ggaggggatt catcagtccg tccaggctct ggctgtgagc 1020 

atacattgag tggaacggca ccttctcaaa gttgtaagcc gtttcaaagt tattgccagt 1080 

tctcagcatt tgtgaaggaa aatactccag gcagtagaaa gcatttctgt ccgtctggtt 1140 

ctgattttcg ccagtcacaa tgccacagta gccatattga ggcaccatga agacgtcatt 1200 

ggggaaagga ggcagacttc cctcttgtcc agcgtccatc acgtacggga gctcatacga 1260 

cgagtccgca aatatctgaa ccgtgctggt aaggttatta gcgaccgtag tctcgccgtt 1320 

cgacgttgtg acctccttaa cttggatatt gaagatttta acgcgcatgg cttttggtcg 1380 

tagtccccag ttgttgttga tgagtctttg ccagtcacgt ggtgagaagt gacagtggaa 1440 

tctgttaaag tcaaagtatc cccagggggt ggagaatccg ttgtaggtgt tgctgtttga 1500 

tgttgttccg agccgcaggt acaagtggtt gttgtaggtg ggcaagaccc aggttctggt 1560 

cgaggttgtt gtgaccttgc cctcagacca ggtggaatcg caatgccaat cacccgaggc 1620 

attacccact ccatcggaac cttgtcccgc atcgacagca tttccgcccg gtgctgcacg 1680 

catttcaatg tctgaagaca tggcgctggt atctgatcct tcagggggtc cgtctccggc 1740 



840 
900 
960 
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tccagtgtcc 
tcctgaggag 
cttagcacct 
tgctcgcccg 
gtcggcgtgg 
cttgtcgtgc 
gttgaagggt 
ctgctggttg 
ctcagagagg 
aagtcattta 
aacacgcaat 
tcttttttct 
tgaccccgtg 
gaaacagcat 
ggtccaccgg 
tgggctcgct 
cgtagaactc 
cttcctgctt 
acatgcggtc 
cgcacatgtt 
tgcacttttg 
ccttggcggt 
agggaaagtt 



tcttcaaagt tgagtctctt 
gagtcgggct cttgtggtga 
tcttcaacca ggcccagagg 
aggttgcccc caaaagacgt 
ttataccgca ggtacggatt 
tcgagggccg ctgcgtccgc 
ccgaggtact cgtagccagg 
gccttgggct tgggggctcc 
ttgtcctcga gccaatctgg 
ttgctcagaa acacagtcat 
ctcgggtgcc cgccccagca 
gacgacgggt tgagattctg 
cgtgaagcag acgttgaaat 
ctgaagcatg cccgcgtgac 
tgcttccgcg tctgacgtcg 
tatatccgcg tcactggggg 
atgcgccacc tcagtcacgt 
ggtcaccttg ccaaagtcgt 
ctgcagcggc tgctggtgct 
ggtgttggag gtgacgatca 
gtccacgcgc accttgcttc 
catcttgccc tcctcccacc 
ctcattggtc cagttgacgc 



tttggctggt 
ctctaacggt 
ttcgagtacc 
atcttcttgc 
gtcacccgct 
cgcgttgacg 
aagcaccaga 
aggtttcagg 
aagataaccg 
ccaagtccac 
gatgatgaat 
acgcgccggg 
tctgattcat 
gagaacattt 
atggctccgc 
cgggtctttt 
gatcctgagc 
gctccagacg 
cgaaggtggt 
cgggcgtggg 
cgcccagaat 
agatcaccat 
agcaagggcg 



tgtttgcctt ttttgccgat 1800 

ctcttctttc caggagccgt " 1860 

ctcttcttgg cctggaagac 1920 

agacgctcct gaaactcggc 1980 

ttgagctgct ggtcgtaggc 2040 

ggctccccct tgtcgagtcc 2100 

ccccggccgt cgtcctgctt 2160 

tcccaccact cgcgaatgcc 2220 

tcagcagcca tacctggttt 2280 

gttgacgaga tcgcaggccg 2340 

cgcgcacagt ttctgatacg 2400 

gaagcactct gagcagtctc 2460 

tctctcgcat gtcttgcagg 2520 

gttttggtac ctgtccgcaa 2580 

aactgagggg caggcccgct 2640 

ggtggctccg ccctttctga 2700 

ccagcggaag aactctttga 2760 

gcgggtgagc tcgaacttga 2820 
gctgttcccg tcgatcacgg 2880 
gtcgatctgg gccgatgact 2940 
ggccttggcg gactccacga 3000 
cttgtcgacg caatcgttga 3060 
aattc 3105 



<210> 22 
<211> 3094 
<212> DNA 

<213> new AAV serotype, clone Fl 
<400> 22 

gaattcgccc ttgctgcgtc aactggacca agagaacttt cccttcaacg attgcgtcga 
caagatggtg atctggtggg aggagggcaa gatgacggcc aaggtcgtgg agtccgccaa 
agccattctg ggcggaagca aggtgcgcgt cgaccaaaag tgcaagtcct cggcccagat 
cgatcccacc cccgtgatcg tcacctccaa caccaacatg tgcgccgtga tcgacgggaa 
cagcaccacc ttcgagcacc agcagccgtt gcaggaccgg atgttcaaat ttgaactcac 



60 
120 
180 
240 
300 
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ccgccgtctg gaacacgact ttggcaaggt gaccaagcag gaagtcaaag agttcttccg 3 60 

ctgggctagt gatcacgtga ctgaggtgac gcatgagttc tacgtcagaa agggcggagc 420 

cagcaaaaga cccgcccccg atgacgcgga tataagcgag cccaagcggg cctgtccctc 480 

agtcacggac ccatcgacgt cagacgcgga aggagctccg gtggactttg ccgacaggta 540 

ccaaaacaaa tgttctcgtc acgcgggcat gcttcagatg ctgtttccct gcaaaacgtg 600 

cgagagaatg aatcagaatt tcaacatttg cttcacgcac ggggtcagag actgtttaga 660 

atgtttcccc ggcgtgtcag aatctcaacc ggtcgtcaga aaaaagacgt atcggaagct 720 

gtgtgcgatt catcatctgc tggggcgggc acccgagatt gcttgctcgg cctgcgacct 780 

ggtcaacgtg gacctggacg actgtgtttc tgagcaataa atgacttaaa ccgggtatgg 840 

ctgccgatgg ttatcttcca gattggctcg aggacaacct ctctgagggc attcgcgagt 900 

ggtgggacct gaaacctgga gccccgaaac ccaaagccaa ccagcaaaag caggacgacg 960 

gccggggtct ggtgcttcct ggctacaagt acctcggacc cttcaacgga ctcgacaagg 1020 

gggagcccgt caacgcggcg gacgcagcgg ccctcgagca cgacaaggcc tacgaccagc 1080 

agctcaaagc gggtgacaat ccgtacctgc ggtataacca cgccgacgcc gagtttcagg 1140 

agcgtctgca agaagatacg tcatttgggg gcaacctcgg gcgagcagtc ttccaggcca 1200 

agaagcgggt tctcgaacct ctcggtctgg ttgaggaagg cgctaagacg gctcctggaa 1260 

agaagagacc catagactct ccagactcct ccacgggcat cggcaaaaaa ggccagcagc 1320 

ccgctaaaaa gaagctcaat tttggtcaga ctggcgactc agagtcagtc cccgaccctc 1380 

aacctcttgg agaacctcca gcagcgccct ctagtgtggg atctggtaca atggctgcag 1440 

gcggtggcgc accaatggca gacaataacg aaggtgccga cggagtgggt aatgcctcag 1500 

gaaattggca ttgcgattcc acatggctgg gcgacagagt catcaccacc agcaccagaa 1560 

cctgggccct ccccacctac aacaaccacc tctacaagca aatctccagc agcagctcag 1620 

gagccaccaa tgacaaccac tacttcggct acagcacccc ctgggggtat tttgacttta 1680 

acagattcca ctgccacttc tcaccacgtg actggcagcg actcatcaac aacaactggg 1740 

gattccggcc caagaagctg cggttcaagc tcttcaacat ccaggtcaag gaggtcacaa 1800 

cgaatgacgg cgtcacgacc atcgctaata accttaccag cacggttcag gtcttctcgg 18 60 

actcggaata ccagctgccg tacgtcctcg gctctgcgca ccagggctgc ctgcctccgt 1920 

tcccggcgga cgtcttcatg attcctcagt acggctacct gactctgaac aacggcagcc 1980 

aatcggtggg ccgttcctcc ttctactgcc tggaatattt cccctctcaa atgctgagaa 2040 

cgggcaacaa ctttgagttc agttacagct tcgaggacgt gcctttccac agcagctacg 2100 

cgcacagcca gagcctagac cggctgatga accctctcat cgaccagtac ctgtactacc 2160 

tggcccggac ccagagcacc acgggttcca ccagggaact gcaatttcat caagctgggc 2220 
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ccaatactat ggccgagcag tcaaagaact ggctgcctgg accctgctat aggcaacagg 2280 

gactgtcaaa gaacttggac tttaacaaca acagcaattt tgcctggact gctgccacta 2340 

aatatcatct gaatggcaga aactctttga ccaatcctgg cattcccatg gcaaccaaca 2400 

aggatgatga ggaccagttc tttcccatca acggggtact ggtttttggc aagacgggag 2460 

ctgccaacaa aactacgctg gaaaacgttc tgatgaccag cgaggaggag atcaagacca 2520 

ctaaccctgt ggctacagaa gaatacggtg tggtctccag caacctgcag ccgtctacag 2580 

ccgggcctca atcacagact atcaacagcc agggagcact gcctggcatg gtctggcaga 2 640 

accgggacgt gtatctgcag ggtcccatct gggccaaaat tcctcacacg gatggcaact 2700 

ttcacccgtc tcctctgatg ggcggttttg gactcaaaca cccgcctcca cagatcctga 27 60 

tcaaaaacac acctgtacct gctaatcctc cggaggtgtt tactcctgcc aagtttgcct 2820 

ccttcatcac gcagtacagc accggacaag tcagcgtgga aatcgagtgg gagctgcaga 28 80 

aagaaaacag caagcgctgg aacccagaaa ttcagtatac ttccaattat gccaagtcta 2940 

ataatgttga atttgctgtg aaccctgatg gtgtttatac tgagcctcgc cccattggca 3000 

ctcgttacct cccccgtaat ctgtaattgc ttgttaatca ataaaccggt tgattcgttt 3060 

cagttgaact ttggtctctg cgaagggcga attc 3094 



<210> 23 
<211> 3095 
<212> DNA 

<213> new AAV serotype, clone F3 
<400> 23 

gaattcgccc ttcgcagaga ccaaagttca actgaaacga atcaaccggt ttattgatta 60 
acaagcaatt acagattacg ggtgaggtaa cgagtgccaa tggggcgagg ctcagtataa 120 
acaccatcag ggttcacagc aaattcaaca ttattagact tggcataatt ggaagtatac 180 
tgaatttctg ggttccagcg cttgctgttt tctttctgca gctcccactc gatttccacg 
ctgacttgtc cggtgctgta ctgcgtgatg aaggaggcaa acttggcagg agtaaacacc 
tccggaggat tagcaggtac aggtgtgttt ttgatcagga tctgtggagg cgggtgtttg 
agtccaaaac cgcccatcag aggagacggg tgaaagttgc catccgtgtg aggaattttg 420 
gcccagatgg gaccctgcag atacacgtcc cggttctgcc agaccatgcc aggcagtgct 480 
ccctggctgt tgatagtctg tgattgaggc ccggctgtag acgactgcag gttgctggag 540 
accacaccgt attcttctgt agccacaggg ttagtggtct tgatctcctc ctcgctggtc 
atcagaacgt tttccagcgt agttttgttg gcagctcccg tcttgccaaa aaccagtacc 
ccgttgatgg gaaagaactg gtcctcatca tccttgttgg ttgccatggg aatgccagga 
ttggtcaaag agtttctgcc attcagatga tatttagtgg cagcagtcca ggcaaaattg 



240 
300 
360 



600 
660 
720 
780 
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ctgttgttgt taaagtccaa gttctttgac agtctctgtt gcctatagca gggtccaggc 840 
agccagttct ttgactgctc ggccatagta ttgggcccag cttgatgaaa ttgcagttcc 900 
ctggtggaac ccgtggtgct ctgggtccgg gccaggtagt acaggtactg gtcgatgaga 960 

gggttcatca gccggtctag gctctggctg tgcgcgtagc tgctgtggaa aggcacgtcc 1020 

tcgaagctgt aactgaactc aaagttgttg cccgttctca gcatttgaga ggggaaatat 1080 

tccaggcagt agaaggagga acggcccacc gattggctgc cgttgtccag agtcaggtag 1140 

ccgtactgag gaatcatgaa gacgtccgcc gggaacggag gcaggcagcc ctggtgcgca 1200 

gagccgagga cgtacggcag ctggtattcc gagtccgaga agacctgaac cgtgctggta 1260 

aggttattag cgatggtcgt gacgccgtca ttcgttgtga cctccttgac ctggatgttg 1320 

aggagcttga accgcagctt cttgggccgg aatccccagt tgttgttgat gagtcgctgc 1380 

cagtcacgtg gtgagaagtg gcagtggaat ctgttaaagt caaaataccc ccagggggtg 1440 

ctgtagccga agtagtggtt gtcattggtg gctcctgagc tgctgctgga gatttgcttg 1500 

tagaggtggt tgttgtaggt ggggagggcc caggttctgg tgctggtggt gatgactctg 1560 

tcgcccagcc atgtggaatc gcaatgccaa tttcctgagg cattacccac tccgtcggca 162 0 

ccttcgttat tgtctgccat tggtgcgcca ccgcctgcag ccattgtacc agatcccaca 1680 

ctagagggcg ctgctggagg ttctccaaga ggttgagggt cggggactga ctctgagtcg 1740 

ccagtctgac caaaattgag cttcttttta gcgggctgct ggcctttttt gccgatgccc 1800 

gtggaggagt ctggagagcc tatgggtctc ttctttccag gagccgtctt agcgccttcc 1860 

tcaaccagac cgagaggttc gagaacccgc ttcttggcct ggaagactgc tcgcccgagg 1920 

ttgcccccaa atgacgtatc ttcttgcaga cgctcctgaa actcggcgtc ggcgtggtta 1980 

taccgcaggt acggattgtc acccgctttg agctgctggt cgtaggcctt gtcgtgctcg 2040 

agggccgctg cgtccgccgc gttgacgggc tcccccttgt cgagtccgtt gaagggtccg 2100 

aggtacttgt agccaggaag caccagaccc cggccgtcgt cctgcttttg ctggttggct 2160 
ttgggtttcg gggctccagg tttcaggtcc caccactcgc gaatgccctc agagaggttg 2220 
tcctcgagcc aatctggaag ataaccatcg gcagccatac ctggtttaag tcatttattg 2280 
ctcagaaaca cagtcgtcca ggtccacgtt gaccaggtcg caggccgagc aagcaatctc 2340 
gggtgcccgc cccagcagat gatgaatcgc acacagcttc cgatacgtct tttttctgac 2400 
gaccggttga gattctgaca cgccggggaa acattctaaa cagtctctga ccccgtgcgt 2460 
gaagcaaatg ttgaaattct gattcattct ctcgcacgtt ttgcagggaa acagcacctg 2520 
aagcatgccc gcgtgacgag aacatttgtt ttggtacctg tcggcaaagt ccaccggagc 2580 
tccttccgcg tctgacgtcg atgggtccgt gactgaggga cgggcccgct tgggctcgct 2640 
tatatccgcg tcatcggggg cgggtctttt gctggctccg ccctttctga cgtagaactc 2700 
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atgcgtcacc tcagtcacgt gatcactagc ccagcggaag aactctttga cttcctgctt 2760 

tgtcaccttg ccaaagtcgt gttccagacg gcgggtgagt tcaaatttga acatccggtc 2820 

ctgcaacggt tgctggtgct cgaaggtggt gctgttcccg tcgatcacgg cgcacatgtt 2880 

ggtgttggag gtgacgatca cgggggtggg atcgatctgg gcggacgact tgcacttttg 2 940 
gtccacgcgc accttgctgc cgccgagaat ggccttggcg gactccacga ccttggccgt 
catcttgccc tcctcccacc agatcaccat cttgtcgacg caatcgttga agggaaagtt 

ctcattggtc cagttgacgc agcaagggcg aattc 3095 

<210> 24 

<211> 3095 

<212> DNA 

<213> new AAV serotype, clone F5 

<400> 24 



3000 
3060 



gaattcgccc 


ttcgcagaga 


ccaaagttca 


actgaaacga 


atcaaccggt 


ttattgatta 


60 


acaagcaatt 


acagattacg 


ggtgaggtaa 


cgagtgccaa 


tggggcgagg 


ctcagtataa 


120 


acaccatcag 


ggttcacagc 


aaattcaaca 


ttattagact 


tggcataatt 


ggaagtatac 


180 


tgaatttctg ggttccagcg cttgctgttt tctttctgca gctcccactc gatttccacg 


240 


ctgacttgtc 


cggtgctgta 


ctgcgtgatg 


aaggaggcaa 


acttggcagg 


agtaaacacc 


300 


tccggaggat 


tagcaggtac 


aggtgtgttt 


ttgatcagga 


tctgtggagg 


cgggtgttcg 


360 


agtccaaaac 


cgcccatcag 


aggagacggg 


tgaaagttgc 


catccgtgtg 


aggaattttg 


420 


gcccagatgg 


gaccctgcag 


atacacgtcc 


cggttctgcc 


agaccatgcc 


aggcagtgct 


480 


ccctggctgt 


tgatagtctg 


tgattgaggc 


ccggctgtag 


acgactgcag 


gttgctggag 


540 


accacaccgt 


attcttctgt 


agccacaggg 


ttagtggtct 


tgatctcctc 


ctcgctggtc 


600 


atcagaacgt 


tttccagcgt 


agttttgttg 


gcagctcccg 


tcttgccaaa 


aaccagtacc 


660 


ccgttgatgg 


gaaagaactg 


gtcctcatca 


tccttgttgg 


ttgccatggg 


aatgccagga 


720 


ttggtcaaag 


agtttctgcc 


attcagatga 


tatttagtgg 


cagcagtcca 


ggcaaaattg 


780 


ctgttgttgt 


taaagtccaa 


gttctttgac 


agtctctgtt 


gcctatagca 


gggtccaggc 


840 


agccagttct 


ttgactgctc 


ggccatagta 


ttgggcccag 


cttgatgaaa 


ttgcagttcc 


900 


ctggtggaac 


ccgtggtgct 


ctgggtccgg 


gccaggtagt 


acaggtactg 


gtcgatgaga 


960 


gggttcatca 


gccggtctag 


gctctggctg 


tgcgcgtagc 


tgctgtggaa 


aggcacgtcc 


1020 


tcgaagctgt 


aactgaactc 


aaagttgttg 


cccgttctca 


gcatttgaga 


ggggaaatat 


1080 


tccaggcagt 


agaaggagga 


acggcccacc 


gattggctgc 


cgttgttcag 


agtcaggtag 


1140 


ccgtactgag 


gaatcatgaa 


gacgtccgcc 


gggaacggag 


gcaggcagcc 


ctggtgcgca 


1200 


gagccgagga 


cgtacggcag 


ctggtattcc 


gagtccgaga 


agacctgaac 


cgtgctggta 


1260 


aggttattag 


cgatggtcgt 


gacgccgtca 


ttcgttgtga 


cctccttgac 


ctggatgttg 


1320 
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aagagcttga accgcagctt cttgggccgg aatccccagt tgttgttgat gagtcgctgc 1380 

cagtcacgtg gtgagaagtg gcagtggaat ctgttaaagt caaaataccc ccagggggtg 1440 

ctgtagccga agtagtggtt gtcattggtg gctcctgagc tgctgctgga gatttgcttg 1500 

tagaggtggt tgttgtaggt ggggagggcc caggttctgg tgctggtggt gatgactctg 1560 

tcgcccagcc atgtggaatc gcaatgccaa tttcctgagg cattacccac tccgtcggca 1620 

ccttcgttat tgtctgccgt tggtgcgcca ccgcctgcag ccattgtacc agatcccaca 1680 

ctagagggcg ctgctggagg ttctccaaga ggttgagggt cggggactga ctctgagtcg 1740 

ccagtctgac caaaattgag cttcttttta gcgggctgct ggcctttttt gccgatgccc 1800 

gtggaggagt ctggagagtc tatgggtctc ttctttccag gagccgtctt agcgccttcc 1860 

tcaaccagac cgagaggttc gagaacccgc ttcttggcct ggaagactgc tcgcccgagg 1920 

ttgcccccaa atgacgtatc ttcttgcagg cgctcctgaa actcggcgtc ggcgtggtta 1980 

taccgcaggt acggattgtc acccgctttg agctgctggt cgtaggcctt gtcgtgctcg 2040 

agggccgctg cgtccgccgc gttgacgggc tcccccttgt cgagtccgtt gaagggtccg 2100 

aggtacttgt agccaggaag caccagaccc cggccgtcgt cctgcttttg ctggttggct 2160 

ttgggtttcg gggctccagg tttcaggtcc caccactcgc gaatgccctc agagaggttg 2220 

tcctcgagcc aatctggaag ataaccatcg gcagccatac ctggtttaag ccatttattg 2280 

ctcagaaaca cagtcgtcca ggtccacgtt gaccaggtcg caggccgagc aggcaatctc 2340 

gggtgcccgc cccagcagat gatgaatcgc acacagcttc cgatacgtct tttttctgac 2400 
gaccggttga gattctgaca cgccggggaa acattctaaa cagtctctga ccccgtgcgt 2460 
gaagcaaatg ttgaaattct gattcattct ctcgcacgtt ttgcagggaa acagcatctg 2520 
aagcatgccc gcgtggcgag aacatttgtt ttggtacctg tcggcaaagt ccaccggagc 2580 
tccttccgcg tctgacgtcg atgggtccgt gactgaggga caggcccgct tgggctcgct 2640 
tatatccgcg tcatcggggg cgggtctttt gctggctccg ccctttctga cgtagaactc 2700 
atgcgtcacc tcagtcacgt gatcactagc ccagcggaag aactctttga cttcctgctt 2760 
tgtcaccttg ccaaagtcgt gttccagacg gcgggtgagt tcaaatttga acatccggtc 2820 
ctgcaacggc tgctggtgct cgaaggtggt gctgttcccg tcgatcacgg cgcgcatgtt 2880 
ggtgttggag gtgacgatca cgggggtggg atcgatctgg gcggacgact tgcacttttg 2940 
gtccacgcgc accttgctgc cgccgagaat ggccttggcg gactccacga ccttggccgt 3000 
catcttgccc tcctcccacc agatcaccat cttgtcgacg caatcgttga agggaaagtt 3060 
ctcattggtc cagttgacgc agcaagggcg aattc 3095 
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<210> 25 

<211> 3142 

<212> DNA 

<213> new AAV serotype, clone H6 

<400> 25 



~t n n /-» n /*i 
daadCyaCyy 


rrr*r*arf+Tf3't"'t~ 
Lj v- Q y Ly a t L 


crtaataccrac 


tcac tat agg 


gcgaaattga 


aattagcggc 


60 


cgcgaattcg 


cctttcgcag 


agaccaaagt 


tcaactgaaa 


cgaattaaac 


ggtttattga 


12 0 


ttaacaagca 


attacagatt 


acaaatcagg 


tatctggtgc 


caatggggcg 


aggctctgaa 


180 


tacacaccat 


tagtgtccac 


agt aaagtcc 


acattaacag 


act tgttgta 


qttggaagtg 


240 


tactgaattt 


cgggattcca 


gcgtttgct g 


ttctccttct 


gcagctccca 


ctcgatctcc 


300 


acgctgacct 


gtcccgtgga 


atactgtgtg 


atgaaagaag 


caaacttggc 


agaactgaag 


360 


tttgtgggag gattggctgg 


aacgggagtg 


tttttgatca 


tgatctgagg 


aggcgggtgt 


420 


ttgagtccaa 


aacctcccat 


cagtggagaa 


ggatgaaagt 


gtccatcggt 


gtgaggaatc 


480 


ttggcccaaa 


tgggtccctg 


caggtacacg 


tctcgatcct 


gccacaccat 


accaggtaac 


540 


gctccttggt 


gattgacagt 


tccagtagtt 


ggaccagtgt 


ttgagttttg 


caaattattt 


60 0 


gacacagtcc 


cgtactgctc 


cgtagccacg ggattggtgg 


ccctgatttc 


ttcttcatct 


660 


gtaatcatga 


cattttccaa 


atccgcgtcg ttggcatttg ttccttgttt 


accaaatatc 


720 


agggttccat 


gcatggggaa 


aaacttttct 


tcgtcatcct 


tgtgactggc 


catagctggt 


780 


cctggattaa 


ccaacgagtc 


ccggccattt 


agatgatact 


ttgtagctgc 


agtccaggga 


840 


aagttgctgt 


tgttgttgtc 


gtttgcctgt 


tttgacagac 


gctgctgtct 


gtagcaaggt 


900 


ccaggcagcc 


agtttttagc 


ttgaagagac 


atgttggttg 


gtccagcttg 


gctaaacagt 


960 


agccgagact 


gctgaagagt 


tccactattt 


gtttgtgtct 


tgttcagata 


atacaggtac 


1020 


tggtcgatca gaggattcat 


cagccgatcc 


agactctggc 


tgtgagcgta 


gctgctgtgg 


1080 


aaaggcacgt 


cttcaaaagt 


gtagctgaac 


tgaaagttgt 


ttccagtacg 


cagcatctga 


1140 


gaaggaaagt 


actccaggca 


gtaaaaggaa 


gagcgtccta 


ccgcctgact 


cccgttgttc 


1200 


agggtgaggt 


atccatactg 


tgggaccatg 


aagacgtccg 


ctggaaacgg 


cgggaggcat 

13 7 7 7 3 


1260 


ccttgatgcg 


ccgagcccag 


gacgtacggg 


agctggtact 


ccgagtcagt 


aaacacctga 


1320 


accgtgctgg 


taaggttatt 


ggcaatcgtc 


gtcgtaccgt 


cattctgcgt 


gacctctttg 


1380 


acttgaatat 


taaagagctt 


gaagttgagt 


cttttgggcc 


ggaatccccg 


gttgtrgt rg 


i a A n 


acgagtcttt gccagtcacg 


tggtgaaaag 


tggcagtgga 


atctgttgaa 


gtcaaaatac 


1500 


ccccaggggg tgctgtagcc 


aaagtagtgg 


ttgtcgttgc 


tggctcctga 


ttggctggag 


1560 


atttgcttgt 


agaggtggtt 


gttgtatgtg 


ggcagggccc 


aggttcgggt 


gctggtggtg 


1620 


atgactctgt 


cgcccagcca 


ttgggaatcg 


caatgccaat 


ttcctgagga 


attacccact 


1680 


ccatcggcac 


cctcgttatt 


gtctgccatt 


ggtgcgccac 


tgcctgtagc 


cattgtagta 


1740 
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gatcccagac 


a tt a rt ft rtrt rt c 


Ly t Ly y uyyi* 


trrtcceracracr 


cr c t cr ct cr cr cr t c 


aggt aeggag 


1800 


4" 4— f~r f^t 4~ 4— m 

uCtyCy lClc 


Lay L L t y auu 


aaaat"t"taat 

UUUU L> L» L» OCA w 


ctttttcttg 


CaCTCTCtCf ctcr 


gcccgctttt 


1860 


^ 4~ 4- /-» /"^ /— < /-r 

ccgy t tcccg 


ay yayyay 


terrrctccaca 

l- y y w w * — - w <-* w u 


ggagagtgct 


ctaccggcc t 


ct tttttccc 


1920 


^ r** +• 

QyayCCy LuL 


uaauayyi/ u o 


r , tr*;%Ar*r , aef<*r 
LLuQO^ay y 


cccacracrcftt 


caagaaccc t 


ctttttcgcc 


1980 




LLLy uuuy ay 


rrttnccccca 


aaagacgtat 


c tt cttt aag 


gcgctcctga 


2040 


— > /— ► 4— 4" 

uaCtctgcyt 


uy y uy y u u 


rrtaetteracrcr 


taccrcrcrttcrt 


c teege tgtc 


aaCTCtaccCTCT 


2100 


t cgtaggcct 


ty Luy Lyu ul. 


y ay yy LiLyv^y 


cfcert c tercet 


ccrttcracccrcf 


ct cccccttg 


2160 


4— fm ^ 4"~ f r~* 

tCyay tCCy t 


ty aayyy i-^*- 


rr a rr rr t ^ r*t t* rr 
y ay y t» ai« l» u y 


tacccacfcfaa 

u# ca ^— ^ — w o*. y y Liu 


gcacaagacc 


cctgctgtcg 


2220 


t cctt atgcc 


yctctycy y y 


_ 4- 4- rr rr+" nrr f" 
l» Lityy ty y u 


n , rrtrrrrrrr , r , arf 
yy v>yyyv»*i~ay 


cr 1 1 1 cr a crc 1 1 


ccaccactgt 


2280 


cttattcctt 


cagagagagt 


gtcctcgagc 


L>aaii< l. y y aa 


frafaarrfllT 

y a l. o a > \^ a i»v« 


cr crc a crc cat a 

y y w y \* ^ w la 


2340 


cctgatttaa 


atcatttatt 


gttcagagat 


yL>ay l. i-r c* i— l> 


aaatccacat 


tgaccagatc 


2400 


gcaggcagtg caagcgtctg gcacctttcc 


cater at at era 


tgaatgt age 


acagtttctg 


2460 


atacgccttt 


ttgacgacag 


aaacgggttg 


acrattct crac 


aegggaaage 


actctaaaca 


2520 


gtctttctgt 


ccgtgagtga 


agcagatatt 


tgaattctga 


ttcattctct 


cgcattgtct 


2580 


gcagggaaac 


agcatcagat 


tcatgcccac 


gtgacgagaa 


catttgtttt 


ggtacctgtc 


2640 


cgcgtagttg 


atcgaagctt 


ccgcgtctga 


ccrtccr a tcrcrc 

w y ^* 3 w y y w 


tgcgcaactg 


actcgcgcgc 


2700 


ccgtttgggc 


tcacttatat 


ctgcgtcact 


rr cr cro er c cr cr ci t 
yyyyy^yyy ^ 


cttttcttag 


ctccaccc tt 


2760 


tttgacgtag 


aattcatgct 


ccacctcaac 


cacrrtcratcc 


tttgcccacc 


ggaaaaagtc 


2820 


tttcacttcc 


tgcttggtga 


cctttccaaa 


rr1ra1"rfai"rr 

y LLQ LLJ a LLU 


agaeggeggg 


taagttcaaa 


2880 


tttgaacatc 


cggtcttgca 


acggctgctg 


gtgctcgaag 


gtcgttgagt tcccgtcaat 


2940 


cacggcgcac 


atgttggtgt 


tggaggtgac 


gatcaeggga 


gtegggtcta 


tetgggcega 


3000 


ggacttgcat ttctggtcca 


cacgcacctt 


gcttcctcca agaatggctt tggccgactc 


3060 


cacgaccttg 


gcggtcatct 


tcccctcctc 


ccaccagatc 


accatcttgt 


egaegcaatg 


3120 


gtaaaaggaa 


agttctcatt 


gg 








3142 



<210> 26 
<211> 3075 
<212> DNA 

<213> new AAV serotype, clone H2 
<400> 26 

tgagaacttt cctttcaacg attgegtegg acaagatggt gatctggtgg gaggagggga 60 
agatgacege caaggtegtg gagteggeca aagecattet tggaggaagc aaggtgcgtg 12 0 
tggaccagaa atgeaagtec tcggcccaga tagacccgac tcccgtgatc gtcacctcca 180 
acaccaacat gtgcgccgtg attgaeggga actcaacgac cttcgagcac cagcagccgt 240 
tgcaagaccg gatgttcaaa tttgaactta cccgccgtct ggatcatgac tttggaaagg 300 
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540 
600 



tcaccaagca ggaagtgaaa gactttttcc ggtgggcaaa ggatcacgtg gttgaggtgg 360 

agcatgaatt ctacgtcaaa aagggtggag ctaagaaaag acccgccccc agtgacgcag 420 

atataagtga gcccaaacgg gcgcgcgagt cagttgcgca gccatcaacg tcagacgcgg 480 
aagcttcgat caactacgcg gacaggtacc aaaaacaaat gttctcgtca cgtgggcatg 
aatctgatgc tgtttccctg cagacaatgc gagagaatga atcagaattc aaatatctgc 

ttcactcacg gacagaaaga ctgtttagag tgctttcccg tgtcagaatc tcaacccgtt 660 

tctgtcgtca aaaaggcgta tcagaaactg tgctacattc atcatatcat gggaaaggtg 720 

ccagacgctt gcactgcctg cgatctggtc aatgtggatt tggatgactg catctctgaa 780 

caataaatga tttaaatcag gtatggctgc cgatggttat cctccagatt ggctcgagga 840 
cactctctct gaagggataa gacagtggtg gaagctcaaa cctggcccac caccaccaaa 900 
gcccgcagag cggcataagg acgacagcag gggtcttgtg cttcctgggt acaagtacct 960 

cggacccttc aacggactcg acaaggggga gccggtcaac gaggcagacg ccgcggccct 1020 

cgagcacgac aaggcctacg accggcagct cgacagcgga gacaacccgt acctcaagta 1080 

caaccacgcc gacgcagagt ttcaggagcg ccttaaagaa gatacgtctt ttgggggcaa 1140 

cctcggacga gcagtcttcc aggcgaaaaa gagggttctt gaacctctgg gcctggttga 1200 

ggaacctgtt aagacggctc cgggaaaaaa gaggccggta gagcactctc ctgtggagcc 1260 

agactcctcc tcgggaaccg gaaaagcggg ccagcggcct gcaagaaaaa gattaaattt 1320 

tggtcagact ggagacgcag actccgtacc tgacccccag cctctcggac agccaccagc 1380 

agccccctct ggtctgggat ctactacaat ggctacaggc agtggcgcac caatggcaga 1440 

caataacgag ggtgccgatg gagtgggtaa ttcctcagga aattggcatt gcgattccca 1500 

atggctgggc gacagagtca tcaccaccag cacccgaacc tgggccctgc ccacatacaa 1560 

caaccacctc tacaagcaaa tctccagcca atcaggagcc agcaacgaca accactactt 1620 

tggctacagc accccctggg ggtattttga cttcaacaga ttccactgcc acttttcacc 1680 

acgtgactgg caaagactca tcaacaacaa ctggggattc cggcccaaaa gactcaactt 1740 

caagctcttt aatattcaag tcaaagaggt cacgcagaat gacggtacga cgacgattgc 1800 

caataacctt accagcacgg ttcaggtgtt tactgactcg gagtaccagc tcccgtacgt 1860 

cctgggctcg gcgcatcaag gatgcctccc gccgtttcca gcggacgtct tcatggtccc 1920 

acagtatgga tacctcaccc tgaacaacgg gagtcaggcg gtaggacgct cttcctttta 1980 
ctgcctggag tactttcctt ctcagatgct gcgtactgga aacaactttc agttcagcta 2040 
cacttttgaa gacgtgcctt tccacagcag ctacgctcac agccagagtc tggatcggct 2100 
gatgaatcct ctgatcgacc agtacctgta ttatctgaac aagacacaaa caaatagtgg 2160 
aactcttcag cagtctcggc tactgtttag ccaagctgga ccaaccaaca tgtctcttca 2220 
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agctaaaaac tggctgcctg gaccttgcta cagacagcag cgtctgtcaa aacaggcaaa 2280 

cgacaacaac aacagcaact ttccctggac tgcagctaca aagtatcatc taaatggccg 2340 

ggactcgttg gttaatccag gaccagctat ggccagtcac aaggatgacg aagaaaagtt 2 400 

tttccccatg catggaaccc tgatatttgg taaacaagga acaaatgcca acgacgcgga 2460 

tttggaaaat gtcatgatta cagatgaaga agaaatcagg gccaccaatc ccgtggctac 2520 

ggagcagtac gggactgtgt caaataattt gcaaaactca aacactggtc caactactgg 2580 

aactgtcaat cgccaaggag cgttacctgg tatggtgtgg caggatcgag acgtgtacct 2640 

gcagggaccc atttgggcca agattcctca caccgatgga cactttcatc cttctccact 2700 

gatgggaggt tttggactca aacacccgcc tcctcagatc atgatcaaaa acactcccgt 2760 

tccagccaat cctcccacaa acttcagttc tgccaagttt gcttctttca tcacacagta 2820 

ttccacggga caggtcagcg tggagatcga gtgggagctg cagaaggaga acagcaaacg 2880 



2940 
3000 
3060 
3075 



ctggaatccc gaaattcagt acacttccaa ctacaacaag tctgttaatg tggactttac 
tgtggacact aatggtgtgt attcagagcc tcgccccatt ggcaccagat acctgactcg 
taatctgtaa ttgcttgtta atcaataaac cgtttaattc gtttcagttg aactttggtc 
tctgcgaagg gcgaa 

<210> 27 
<211> 3128 
<212> DNA 

<213> new AAV serotype, clone 42.8 
<400> 27 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 
gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 12 0 
ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcttccg 180 
cccagatcga tcccaccccc gtgatcgtca cttccaacac caacatgtgc gccgtgattg 
acgggaacag caccaccttc gagcaccagc agccgttaca agaccggatg ttcaaatttg 
aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcaaagagt 360 
tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 420 
gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 48 0 
gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 



240 
300 



600 
660 



acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 
agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 
gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 
ggaaactctg tgccattcat catctgctag ggcgggctcc cgagattgct tgctcggcct 780 
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gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 840 

ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 900 

cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 1140 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagaccggt agagccatca ccccagcgtt ctccagactc ctctacgggc 1320 

atcggcaaga caggccagca gcccgcgaaa aagagactca actttgggca gactggcgac 1380 

tcagagtcag tgcccgaccc tcaaccaatc ggagaacccc ccgcaggccc ctctggtctg 1440 

ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 1500 

gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 1560 

gtcatcacca ccagcacccg aacctgggcc ctccccacct acaacaacca cctctacaag 1620 

caaatctcca acgggacatc gggaggaagc accaacgaca acacctactt cggctacagc 1680 

accccctggg ggtattttga ctttaacaga ttccactgcc acttctcacc acgtgactgg 1740 

cagcgactca tcaacaacaa ctggggattc cggcccaaga gactcaactt caagctcttc 1800 

aacatccagg tcaaggaggt cacgcagaat gaaggcacca agaccatcgc caataacctt 1860 

accagcacga ttcaggtctt tacggactcg gaataccagc tcccgtacgt cctcggctct .1920 

gcgcaccagg gctgcctgcc tccgttcccg gcggacgtct tcatgattcc tcagtacggg 1980 

tacctgactc tgaacaacgg cagtcaggcc gtgggccgtt cctccttcta ctgcctggag 2040 

tactttcctt ctcaaatgct gagaacgggc aacaactttg agttcagcta ccagtttgag 2100 

gacgtgcctt ttcacagcag ctacgcgcac agccaaagcc tggaccggct gatgaacccc 2160 

ctcatcgacc agtacctgta ctacctgtct cggactcagt ccacgggagg taccgcagga 2220 

actcagcagt tgctattttc tcaggccggg cctaataaca tgtcggctca ggccaaaaac 2280 

tggctacccg ggccctgcta ccggcagcaa cgcgtctcca cgacactgtc gcaaaataac 2340 

aacagcaact ttgcttggac cggtgccacc aagtatcatc tgaatggcag agactctctg 2400 

gtaaatcccg gtgtcgctat ggcaacgcac aaggacgacg aagagcgatt ttttccatcc 2460 

agcggagtct tgatgtttgg gaaacaggga gctggaaaag acaacgtgga ctatagcagc 2520 

gttatgctaa ccagtgagga agaaatcaaa accaccaacc cagtggccac agaacagtac 258 0 

ggcgtggtgg ccgataacct gcaacagcaa aacgccgctc ctattgtagg ggccgtcaac 2 640 

agtcaaggag ccttacctgg catggtctgg cagaaccggg acgtgtacct gcagggtcct 2700 
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3000 
3060 



atctgggcca agattcctca cacggacggc aactttcatc cttcgccgct gatgggaggc 2760 

tttggactga aacacccgcc tcctcagatc ctgattaaga atacacctgt tcccgcggat 2820 

cctccaacta ccttcagtca agccaagctg gcgtcgttca tcacgcagta cagcaccgga 2880 

caggtcagcg tggaaattga atgggagctg cagaaagaga acagcaagcg ctggaaccca 2940 
gagattcagt atacttccaa ctactacaaa tctacaaatg tggactttgc tgtcaatact 
gagggtactt attcagagcc tcgccccatt ggcacccgtt acctcacccg taacctgtaa 

ttgcctgtta atcaataaac cggctaattc gtttcagttg aactttggtc tctgcgaagg 3120 

gcgaattc 3128 

<210> 28 
<211> 3128 
<212> DNA 

<213> new AAV serotype, clone 42.15 
<400> 28 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 180 

cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 

acgggaacag caccaccttc gagcaccagc agccgttgca ggaccggatg ttcaaatttg 300 

aactcacccg ccgtctggag catgactttg gcaaggtgac aaagcaggaa gtcaaagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

agacatgcga gagaatgaat cagaatttca acatttgctt cacgcgcggg accagagact 660 

gttcagaatg tttcccgggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 

ggaaactctg tgccattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 780 

gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 840 

ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 900 

cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 1140 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 12 60 
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cctggaaaga agagaccggt agagccatca ccccagcgtt ctccagactc ctctacgggc 1320 

atcggcaaga caggccagca gcccgcgaaa aagagactca actttgggca gactggcgac 1380 

tcagagtcag tgcccgaccc tcaaccaatc ggagaacccc ccgcaggccc ctctggtctg 1440 

ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 1500 

gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 1560 

gtcatcacca ccagcacccg aacctgggcc ctccccacct acaacaacca cctctacaag 1620 

caaatctcca acgggacatc gggaggaagc accaacgaca acacctactt cggctacagc 1680 

accccctggg ggtattttga ctttaacaga ttccactgcc acttctcacc acgtgactgg 1740 

cagcgactca tcaacaacaa ctggggattc cggcccaaga gactcaactt caagctcttc 1800 

aacatccagg tcaaggaggt cacgcagaat gaaggcacca agaccatcgc caataacctt 18 60 

accagcacga ttcaggtctt tacggactcg gaataccagc tcccgtacgt cctcggctct 1920 

gcgcaccagg gctgcccgcc tccgttcccg gcggacgtct tcatgattcc tcagtacggg 1980 

tacctgactc tgaacaacgg cagtcaggcc gtgggccgtt cctccttcta ctgcctggag 2040 

tactttcctt ctcaaatgcg gagaacgggc aacaactttg agttcagcta ccagtttgag 2100 

gacgtgcctt ttcacagcag ctacgcgcat agccaaagcc tggaccggct gatgaacccc 2160 

ctcatcgacc agtacctgta ctacctgtct cggactcagt ccacgggagg taccgcagga 2220 

actcagcagt tgctattttc tcaggccggg cctaataaca tgtcggctca ggccaaaaac 2280 

tggctacccg ggccctgcta ccggcagcaa cgcgtctcca cgacactgtc gcaaaataac 2340 

aacagcaact ttgcttggac cggtgccacc aagtatcatc tgaatggcag agactctctg 2400 

gtaaatcccg gtgtcgctat ggcaacgcac aaggacgacg aagagcgatt ttttccatcc 2460 

agcggagtct tgatgtttgg gaaacaggga gctggaaaag acaacgtgga ctatagcagc 2520 

gttatgctaa ccagtgagga agaaatcaaa accaccaacc cagtggccac agaacagtac 2580 

ggcgtggtgg ccgataacct gcaacagcaa aacgccgctc ctattgtagg ggccgtcaac 2640 

agtcaaggag ccttacctgg catggtctgg cagaaccggg acgtgtacct gcagggtcct 2700 

atctgggcca agattcctca cacggacggc aactttcatc cttcgccgct gatgggaggc 2760 

tttggactga aacacccgcc tcctcagatc ctgattaaga atacacctgt tcccgcggat 2820 

cctccaacta ccttcagtca' agccaagctg gcgtcgttca tcacgcagta cagcaccgga 2880 

caggtcagcg tggaaattga atgggagctg cagaaagaga acagcaagcg ctggaaccca 2940 

gagattcagt atacttccaa ctactacaaa tctacaaatg tggactttgc tgtcaatact 3000 

gagggtactt attcagagcc tcgccccatt ggcacccgtt acctcacccg taacctgtaa 3060 

ttgcctgtta atcaataaac cggttaattc gtttcagttg aactttggtc tctgcgaagg 3120 

gcgaattc 3128 
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<210> 29 
<211> 3197 
<212> DNA 

<213> new AAV serotype, clone 42.5b 
<400> 29 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 18 0 

cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 24 0 

acgggaacag caccaccttc gagcaccagc agccgttaca agaccggatg ttcaaatttg 300 

aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcaaagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 660 

gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 

ggaaactctg tgccattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 780 

gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 840 

ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 900 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaagggag agccggtcaa cgaggcagac gccgcggccc tcgagcacga caaggcctac 1080 

gacaagcagc tcgagcaggg ggacaacccg tacctcaagt acaaccacgc cgacgccgag 1140 

tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagaccggt agagccatca ccccagcgtt ctccagactc ctctacgggc 1320 

atcggcaaga caggccagca gcccgcgaaa aagagactca actttgggca gactggcgac 1380 

tcagagtcag tgcccgaccc tcaaccaatc ggagaacccc ccgcaggccc ctctggtctg 1440 

ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 1500 

gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 1560 

gtcatcacca ccagcacccg aacctgggcc ctccccacct acaacaacca cctctacaag 1620 

caaatctcca acgggacatc gggaggaagc accaacgaca acacctactt cggctacagc 1680 

accccctggg ggtattttga ctttaacaga ttccactgcc acttctcacc acgtgactgg 1740 
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cagcgactca 


tcaacaacaa 


ctggggattc cggcccaaga 


gactcaactt 


caagctcttc 


1800 


aacatccacrq 


tcaaggaggt 


cacgcagaat gaaggcacca 


agaccatcgc 


caataacctt 


1860 


accagcacga 


ttcaggtctt 


tacggactcg gaataccagc 


tcccgtacgt 


cctcggctct 


1920 


gcgcaccagg 


gctgcctgcc 


tccgttcccg gcggacgtct 


tcatgattcc 


tcagtacggg 


1980 


tacctgactc 


tgaacaacgg 


cagtcaggcc gtgggccgtt 


cctccttcta 


ctgcctggag 


2040 


tactttcctt 


ctcaaatgct 


gagaacgggc aacaactttg 


agttcagcta 


ccagtttgag 


2100 


gacgtgcctt 


ttcacagcag 


ctacgcgcac agccaaagcc 


tggaccggct 


gatgaacccc 


2160 


ctcatcgacc 


agtacctgta 


ctacctgtct cggactcagt 


ccacgggagg 


taccgcagga 


2220 


act cagcagt 


tgctattttc 


tcaggccggg cctaataaca 


tgtcggctca 


ggccaaaaac 


2280 


tggctacccg 


ggccctgcta 


ccggcagcaa cgcgtctcca 


cgacactgtc 


gcaaaataac 


2340 


aacagcaact 


ttgcttggac 


cggtgccacc aagtatcatc 


tgaatggcag 


agactctctg 


2400 


gtaaatcccg 


gtgtcgctat 


ggcaacgcac aaggacgacg 


aagagcgatt 


ttttccatcc 


2460 


agcqqaqtct 


tgatgtttgg 

7 7 7 7 


gaaacaggga gctggaaaag 


acaacgtgga 


ctatagcagc 


2520 


gttatgctaa 


ccagtgagga 


agaaatcaaa accaccaacc 


cagtggccac 


agaacagtac 


2580 


aacataqtqcr 


ccgataacct 


gcaacagcaa aacgccgctc 


ctattgtagg 


ggccgtcaac 


2640 


agt caaggag 


ccttacctgg 


catggtctgg cagaaccggg 


acgtgtacct 


gcagggtcct 


2700 


atctgggcca 


agattcctca 


cacggacggc aactttcatc 

7 7 7 7 


cttcgccgct 


gatgggaggc 


2760 


tttggactga 


aacacccgcc 


tcctcagatc ctgattaaga 


atacacctgt 


tcccgcggat 


2820 


cctccaacta 


ccttcagtca 


agccaagctg gcgtcgttca 

7 7 7 7 — ' 


tcacgcagta 


cagcaccgga 


2880 


caggtcagcg 


tggaaattga 


atgggagctg cagaaagaga 


acagcaagcg 


ctggaaccca 


2940 


gagattcagt 


atacttccaa 


ctactacaaa tctacaaatg 


tggactttgc 


tgtcaatact 


3000 


gagggtactt attcagagcc tcgccccatt ggcacccgtt acctcacccg taacctgtaa 


3060 


ttgcctgtta 


atcaataaac 


cggttaattc gtttcagttg 


aactttggtc 


tctgcgaagg 


3120 


gcgaattcgt 


ttaaacctgc 


aggactagtc cctttagtga 


gggttaattc 


tgagcttggc 


3180 


gtaatcatgg 


gtcatag 








3197 



<210> 30 
<211> 2501 
<212> DNA 

<213> new AAV serotype, clone 42.1b 
<400> 30 

gaattcgccc ttggctgcgt caactggacc aatgagaact ttcccttcaa cgattgcgtc 
gacaagatgg tgatctggtg ggaggagggc aagatgacgg ccaaggtcgt ggagtccgcc 
aaggccattc atcatctgct ggggcgggct cccgagattg cttgctcggc ctgcgatctg 
gtcaacgtgg acctggatga ctgtgtttct gagcaataaa tgacttaaac caggtatggc 
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tgccgatggt tatcttccag attggctcga ggacaacctc tctgagggca ttcgcgagtg 300 

gtgggacttg agacctggag ccccgaaacc caaagccaac cagcaaaagc aggacgacgg 360 

ccggggtctg gtgcttcctg gctacaagta cctcggaccc ttcaacggac tcgacaaggg 420 

agagccggtc aacgaggcag acgccgcggc cctcgagcac gacaaggcct acgacaagca 480 

gctcgagcag ggggacaacc cgtacctcaa gtacaaccac gccgacgccg agtttcagga 540 

gcgtcttcaa gaagatacgt cttttggggg caacctcggg cgagcagtct tccaggccaa 600 

gaagcgggtt ctcgaacctc tcggtctggt tgaggaaggc gctaagacgg ctcctggaaa 660 

gaagagaccc atagaatccc ccgactcctc cacgggcatc ggcaagaaag gccagcagcc 720 

cgctaaaaag agactcaact ttgggcagac tggcgactca gagtcagtgc ccgaccctca 780 
accaatcgga gaaccccccg caggcccctc tggtctggga tctggcacaa tggctgcagg 840 
cggtggcgct ccaatggcag acaataacga aggcgccgac ggagtgggta gttcctcagg 900 
aaattggcat tgcgattcca catggctggg cgacagagtc atcaccacca gcacccgaac 960 

ctgggccctc cccacctaca acaaccacct ctacaagcaa atctccaacg ggacatcggg 1020 

aggaagcacc aacgacaaca cctacttcgg ctacagcacc ccctgggggt attttgactt 108 0 

taacagattc cactgccact tctcaccacg tgactggcag cgactcatca acaacaactg 1140 

gggattccgg cccaagagac tcaacttcaa gctcttcaac atccaggtca aggaggtcac 1200 

gcagaatgaa ggcaccaaga ccatcgccaa taaccttacc agcacgattc aggtctttac 1260 

ggactcggaa taccagctcc cgtacgtcct cggctctgcg caccagggct gcctgcctcc 1320 

gttcccggcg gacgtcttca tgattcctca gtacgggtac ctgactctga acaacggcag 1380 

tcaggccgtg ggccgttcct ccttctactg cctggagtac tttccttctc aaatgctgag 1440 

aacgggcaac aactttgagt tcagctacca gtttgaggac gtgccttttc acagcagcta 1500 

tgcgcacagc caaagcctgg accggctgat gaaccccctc atcgaccagt acctgtacta 1560 

cctgtctcgg actcagtcca cgggaggtac cgcaggaact cagcagttgc tattttctca 1620 

ggccgggcct aataacatgt cggctcaggc caaaaactgg ctacccgggc cctgctaccg 1680 

gcagcaacgc gtctccacga cagtgtcgca aaataacaac agcaactttg cttggaccgg 1740 

tgccaccaag tatcatctga atggcagaga ctctctggta aatcccggtg tcgctatggc 1800 

aacgcacaag ggcgacgaag agcgattttt tccatccagc ggagtcttga tgtttgggaa 1860 

acagggagct ggaaaagaca acgtagacta tagcagcgtt atgctaacca gtgaggaaga 1920 

aatcaaaacc accaacccag tggccacaga acagtacggc gtggtggccg ataacctgca 1980 
acagcaaaac gccgctccta ttgtaggggc cgtcaacagt caaggagcct tacctggcat 2040 
ggtctggcag aaccgggacg tgtacctgca gggtcctatc tgggccaaga ttcctcacac 2100 
ggacggcaac tttcatcctt cgccgctgat gggaggcttt ggactgaaac acccgcctcc 2160 
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tcagatcctg attaagaata cacctgttcc cgcggatcct ccaactacct tcagtcaagc 2220 

caagctggcg tcgttcatca cgcagtacag caccggacag gtcagcgtgg aaattgaatg 2280 

ggagctgcag aaagagaaca gcaagcgctg gaacccagag attcagtata cttccaacta 2340 

ctacaaatct acaaatgtgg actttgctgt caatactgag ggtacttatt cagagcctcg 2400 

ccccattggc acccgttacc tcacccgtaa cctgtaattg cctgttaatc aataaaccgg 2460 

ttgattcgtt tcagttgaac tttggtctca agggcgaatt c 2501 

<210> 31 

<211> 3113 

<212> DNA 

<213> new AAV serotype, clone 42.13 



<400> 31 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 



60 



180 
240 
300 



600 
660 



gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 
ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 
cccagatcga tcccaccccc gtgategtca cttccaacac caacatgtgc gccgtgattg 
acgggaacag caccaccttc gagcaccagc agccgttaca agaccggatg ttcaaatttg 
aactcacccg ccgtctggag catgactttg gcaaggtgac aaagcaggaa gtcaaagagt 360 
tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 420 
gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 
gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 
acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 
agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 
gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 
ggaaactctg tgccattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 780 
gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 
ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 
gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 
gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 1080 
gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 1140 
tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 
caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 12 60 
cctggaaaga agagacccat agaatccccc gactcctcca cgggcatcgg caagaaaggc 1320 
cagcagcccg ctaaaaagaa gctcaacttt gggcagactg gcgactcaga gtcagtgccc 1380 



840 
900 
960 
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gaccctcaac caatcggaga accccccgca ggcccctctg gtctgggatc tggtacaatg 1440 

gctgcaggcg gtggcgctcc aatggcagac aataacgaag gcgccgacgg agtgggtagt 1500 

tcctcaggaa attggcattg cgattccaca tggctgggcg acagagtcat caccaccagc 1560 

acccgaacct gggccctccc cacctacaac aaccacctct acaagcaaat ctccaacggg 1620 

acatcgggag gaagcaccaa cgacaacacc tacttcggct acagcacccc ctgggggtat 1680 

tttgacttta acagattcca ctgccacttc tcaccacgtg actggcagcg actcatcaac 1740 

aacaactggg gattccggcc caagagactc aacttcaagc tcttcaacat ccaggtcaag 1800 

gaggtcacgc agaatgaagg caccaagacc atcgccaata accttaccag cacgattcag 18 60 

gtctttacgg actcggaata ccagctcccg tacgtcctcg gctctgcgca ccagggctgc 1920 

ctgcctccgt tcccggcgga cgtcttcatg attcctcagt acgggtacct gactctgaac 1980 

aacggcagtc aggccgtggg ccgttcctcc ttctactgcc tggagtactt tccttctcaa 2040 

atgctgagaa cgggcaacaa ctttgagttc agctaccagt ttgaggacgt gccttttcac 2100 

agcagctatg cgcacagcca aagcctggac cggctgatga accccctcat cgaccagtac 2160 

ctgtactacc tgtctcggac tcagtccacg ggaggtaccg caggaactca gcagttgcta 2220 

ttttctcagg ccgggcctaa taacatgtcg gctcaggcca aaaactggct acccgggccc 2280 

tgctaccggc agcaacgcgt ctccacgaca gtgtcgcaaa ataacaacag caactttgct 2340 

tggaccggtg ccaccaagta tcatctgaat ggcagagact ctctggtaaa tcccggtgtc 2400 

gctatggcaa cgcacaaggg cgacgaagag cgattttttc catccagcgg agtcttgatg 2460 

tttgggaaac agggagctgg aaaagacaac gtggactata gcagcgttat .gctaaccagt 2520 

gaggaagaaa tcaaaaccac caacccagtg gccacagaac agtacggcgt ggtggccgat 2580 

aacctgcaac agcaaaacgc cgctcctatt gtaggggccg tcaacagtca aggagcctta 2640 

cctggcatgg tctggcagaa ccgggacgtg tacctgcagg gtcctatctg ggccaagatt 2700 

cctcacacgg acggcaactt tcatccttcg ccgctgatgg gaggctttgg actgaaacac 2760 

ccgcctcctc agatcctgat taagaataca cctgttcccg cggatcctcc aactaccttc 2820 

agtcaagcca agctggcgtc gttcatcacg cagtacagca ccggacaggt cagcgtggaa 2 88 0 

attgaatggg agctgcagaa agagaacagc aagcgctgga acccagagat tcagtatact 2 940 

tccaactact acaaatctac aaatgtggac tttgctgtca atactgaggg tacttattca 3000 

gagcctcgcc ccattggcac ccgttacctc acccgtagcc tgtaattgcc tgttaatcaa 3060 

taaaccggtt gattcgtttc agttgaactt tggtctctgc gaagggcgaa ttc 3113 

<210> 32 
<211> 3113 
<212> DNA 

<213> new AAV serotype, clone 42.3a 
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<400> 32 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 180 

cccagatcga tcccaccccc gtgatcgtca cttccaacac caacatgtgc gccgtgattg 240 

acgggaacag caccaccttc gagcaccagc agccgttaca agaccggatg ttcaaatttg 300 

aactcacccg ccgtctggag catgactttg gcaaggtgac aaagcaggaa gtcaaagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg cttccctgca 600 

agacatgcga gagaatgaat cagaatttca gcatttgctt cacgcacggg accagagact 660 

gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 

ggaaactctg tgccattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 7 80 

gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 840 

ggtatggctg ccgatggtca tcttccagat tggctcgagg acaacctctc tgagggcatt 900 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 1140 

tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagacccat agaatccccc gactcctcca cgggcatcgg caagaaaggc 1320 

cagcagcccg ctaaaaagaa gctcaacttt gggcagactg gcgactcaga gtcagtgccc 1380 

gaccctcaac caatcggaga accccccgca ggcccctctg gtctgggatc tggtacaatg 1440 

gctgcaggcg gtggcgctcc aatggcagac aataacgaag gcgccgacgg agtgggtagt 1500 

tcctcaggaa attggcattg cgattccaca tagctgggcg acagagtcat caccaccagc 1560 

acccgaacct gggccctccc cacctacaac aaccacctct acaagcaaat ctccaacggg 1620 

acatcgggag gaagcaccaa cgacaacacc tacttcggct acagcacccc ctgggggtat 1680 

tttgacttta acagattcca ctgccacttc tcaccacgtg actggcagcg actcatcaac 1740 

aacagctggg gattccggcc caagagactc aacttcaagc tcttcaacat ccaggtcaag 1800 

gaggtcacgc agaatgaagg caccaagacc atcgccaata accttaccag cacgattcag 1860 

gtctttacgg actcggaata ccagctcccg tacgtcctcg gctctgcgca ccagggctgc 1920 
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ctgcctccgt tcccggcgga cgtcttcatg attcctcagt acgggtacct gactctgaac 1980 

aacggcagtc aggccgtggg ccgttcctcc ttctactgcc tggagtactt tccttctcaa 2040 

atgctgagaa cgggcaacaa ctttgagttc agctaccagt ttgaggacgt gccttttcac 2100 

agcagctacg cgcacagcca aagcctggac cggctgatga accccctcat cgaccagtac 2160 

ctgtactacc tgtctcggac tcagtccacg ggaggtaccg caggaactca gcagttgcta 2220 

ttttctcagg ccgggcctaa taacatgtcg gctcaggcca aaaactggct acccgggccc 2280 

tgctaccggc agcaacgcgt ctccacgaca ctgtcgcaaa ataacaacag caactttgct 2340 

tggaccggtg ccaccaagta tcatctgaat ggcagagact ctctggtaaa tcccggtgtc 2400 

gctatggcaa cgcacaagga cgacgaagag cgattttttc catccagcgg agtcttgatg 2460 

tttgggaaac agggagctgg aaaagacaac gtggactata gcagcgttat gctaaccagt 2520 

gaggaagaaa tcaaaaccac caacccagtg gccacagaac agtacggcgt ggtggccgat 2580 

aacctgcaac agcaaaacgc cgctcctatt gtaggggccg tcaacagtca aggagcctta 2 640 

cctggcatgg tctggcagaa ccgggacgtg tacctgcagg gtcctatctg ggccaagatt 2700 

cctcacacgg acggcaactt tcatccttcg ccgctgatgg gaggctttgg actgaaacac 2760 

ccgcctcctc agatcctgat taagaataca cctgttcccg cggatcctcc aactaccttc 2820 

agtcaagcca agctggcgtc gttcatcacg cagtacagca ccggacaggt cagcgtggaa 2880 

attgaatggg agctgcagaa agagaacagc aagcgctgga acccagagat tcagtatact 2940 

tccaactact acaaatctac aaatgtggac tttgctgtca atactgaggg tacttattca 3000 

gagcctcgcc ccattggcac ccgttacctc acccgtaacc tgtaattgcc tgttaatcaa 3060 

taaaccggtt aattcgtttc agttgaactt tggtctctgc gaagggcgaa ttc 3113 

<210> 33 
<211> 2504 
<212> DNA 

<213> new AAV serotype, clone 42.4 
<400> 33 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattcatcat ctgctggggc gggctcccga gattgcttgc tcggcctgcg 180 

atctggtcaa cgtggacctg gatgactgtg tttctgagca ataaatgact taaaccaggt 240 

atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 300 

gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 360 

gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 420 

aagggagagc cggtcaacga ggcagacgcc gcggccctcg agcacgacaa ggcctacgac 480 

aagcagctcg agcaggggga caacccgtac ctcaagtaca accacgccga cgccgagttt 540 
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caggagcgtc ttcaagaaga tacgtctttt gggggcaacc tcgggcgagc. agtcttccag 600 

gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 660 

ggaaagaaga gacccataga atcccccgac tcctccacgg gcatcggcaa gaaaggccag 720 

cagcccgcta aaaagaagct caactttggg cagactggcg actcagagtc agtgcccgac 780 

cctcaaccaa tcggagaacc ccccgcaggc ccctctggtc tgggatctgg tacaatggct 840 

gcaggcggtg gcgctccaat ggcagacaat aacgaaggcg ccgacggagt gggtaatgcc 900 
tccggaaatt ggcattgcga ttccacatgg ctgggcgaca gagtcatcac caccagcacc 960 

cgcacctggg ccctgcccac ctacaacaac cacctctaca agcagatatc aagtcagagc 1020 

ggggctacca acgacaacca cttcttcggc tacagcaccc cctggggcta ttttgacttc 1080 

aacagattcc actgccactt ctcatcacgt gactggcagc gactcatcaa caacaactgg 1140 

ggattccggc ccaagagact caacttcaag ctcttcaaca tccaggtcaa ggaggtcacg 1200 

cagaatgaag gcaccaagac catcgccaat aaccttacca gcacgattca ggtctttacg 1260 

gactcggaat accggctccc gtacgtcctc ggctctgcgc accagggctg cctgcctccg 1320 

ttcccggcgg acgtcttcat gattcctcag tacgggtacc tgactctgaa caacggcagt 138 0 

caggccgtgg gccgttcctc cttctactgc ctggagtact ttccttctca aatgctgaga 1440 

acgggcaaca actttgagtt cagctaccag tttgaggacg tgccttttca cagcagctac 1500 

gcgcacagcc aaagcctgga ccggctgatg aaccccctca tcgaccagta cctgtactac 1560 

ctgtctcgga ctcagtccac gggaggtacc gcaggaactc agcagttgct attttctcag 1620 

gccgggccta ataacatgtc ggctcaggcc aaaaactggc tacccgggcc ctgctaccgg 1680 

cagcaacgcg tctccacgac actgtcgcaa aataacaaca gcaactttgc ttggaccggt 17 40 

gccaccaagt atcatctgaa tggcagagac tctctggtaa atcccggtgt cgctatggca 1800 

acgcacaagg acgacgaaga gcgatttttt ccatccagcg gagtcttgat gtttgggaaa 1860 

cagggagctg gaaaagacaa cgtggactat agcagcgtta tgctaaccag tgaggaagaa 1920 

atcaaaacca ccaacccagt ggccacagaa cagtacggcg tggtggccga taacctgcaa 1980 

cagcaaaacg ccgctcctat tgtaggggcc gtcaacagtc aaggagcctt acctggcatg 2040 

gtctggcaga accgggacgt gtacctgcag ggtcctatct gggccaagat tcctcacacg 2100 

gacggcaact ttcatccttc gccgctgatg ggaggctttg gactgaaaca cccgcctcct 2160 

cagatcctga ttaagaatac acctgttccc gcggatcctc caactacctt cagtcaagcc 2220 
aagccggcgt cgttcatcac gcagtacagc accggacagg tcagcgtgga aattgaatgg 2280 
gagctgcaga aagagaacag caagcgctgg aacccagaga ttcagtatac ttccaactac 2340 
tacaaatcta caaatgtgga ctttgctgtc aatactgagg gtacttattc agagcctcgc 2400 
cccattggca cccgttacct cacccgtaac ctgtaattgc ctgttaatca ataaaccggt 2460 
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taattcgttt cagttgaact ttggtctctg cgaagggcga attc 2504 

<210> 34 

<211> 3106 

<212> DNA 

<213> new AAV serotype, clone 42.5a 

<400> 34 



gaattcgccc 


ttctacggct 


gcgtcaactg 


gaccaatgag 


aactttccct 


tcaacgattg 


60 


cgtcgacaag 


atggtgatct 


ggtgggagga 


gggcaagatg 


acggccaagg 


tcgtggagtc 


120 


cgccaaggcc 


attctcggcg 


gcagcaaggt 


gcgcgtggac 


caaaagtgca 


agtcgtccgc 


180 


ccagatcgac 


cccacccccg 


tgatcgtcac 


ctccaacacc 


aacatgtgcg 


ccgtgattga 


240 


cgggaacagc 


accaccttcg 


agcaccagca 


gccgttgcag 


gaccggatgt 


tcaaatttga 


300 


actcacccgc 


cgtctggagc 


atgactttgg 


caaggcgaca 


aagcaggaag < 


tcaaagagtt 


360 


cttccgctgg 


gcgcaggatc 


acgtgaccga 


ggtggcgcat 


gagttctacg 


tcagaaaggg 


420 


tggagccaac 


aagagacccg 


cccccgatga 


cgcggataaa 


agcgagccca 


agcgggcccg 


480 


cccctcagtc 


gcggatccat 


cgacgtcaga 


cgcggaagga 


gctccggtgg 


actttgccga 


540 


caggtaccaa 


aacaaatgtt 


ctcgtcacgc 


gggcatgctt 


cagatgctgt 


ttccctgcaa 


600 


aacatgcgag 


agaatgaatc 


agaatttcaa 


catttgcttc 


acgcacggga 


ccagagactg 


660 


ttcagaatgt 


ttccccggcg 


tgtcagaatc 


tcaaccggtc 


gtcagaaaga 


ggacgtatcg 


720 


gaaactctgt 


gccattcatc 


atctgctggg 


gcgggctccc 


gagattgctt 


gctcggcctg 


780 


cgatctggtc 


aacgtggacc 


tggatgactg 


tgtttctgag 


caataaatga 


cttaaaccag 


840 


gtatggctgc 


cgatggttat 


cttccagatt 


ggctcgagga 


caacctctct 


gagggcattc 


900 


gcgagtggtg 


ggacttgaaa 


cctggagccc 


cgaaacccaa 


agccaaccag 


caaaagcagg 


960 


acgacggccg 


gggtctggtg 


cttcctggct 


acaagtacct 


cggacccttc 


aacggactcg 


1020 


acaagggaga 


gccggtcaac 


gaggcagacg 


ccgcggccct 


cgagcacgac 


aaggcctacg 


1080 


acaagcagct 


cgagcagggg 


gacaacccgt 


acctcaagta 


caaccacgcc 


gacgccgagt 


1140 


ttcaggagcg 


tcttcaagaa 


gatacgtctt 


ttgggggcaa 


cctcgggcga 


gcagtcttcc 


1200 


gggccaagaa 


gcgggttctc 


gaacctctcg 


gtctggttga 


ggaaggcgct 


aagacggctc 


1260 


ctggaaagaa 


gagacccata 


gaatcccccg 


actcctccac 


gggcatcggc 


aagaaaggcc 


1320 


agcagcccgc 


taaaaagaag 


ctcaactttg 


ggcagactgg 


cgactcagag 


tcagtgcccg 


1380 


acccccaacc 


tctcggagaa 


cctcccgccg 


cgccctcagg 


tctgggatct 


ggtacaatgg 


1440 


ctgcaggcgg 


tggcgcacca 


atggcagaca 


ataacgaagg 


cgccgacgga 


gtgggtaatg 


1500 


cctccggaaa 


ttggcattgc 


gattccacat 


ggctgggcga 


cagagtcatc 


accaccagca 


1560 


cccgcacctg 


ggccctgccc 


acctacaaca 


accacctcta 


caagcagata 


tcaagtcaga 


1620 


gcggggctac 


caacgacaac 


cacttcttcg 


gctacagcac 


cccctggggc 


tattttgact 


1680 
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tcaacagatt 


ccactgccac 


ttctcaccac 


gtgactggca 


gcgactcatc 


aacaacaacc 


1740 


ggggattccg 


gcccagaaag 


ctgcggttca 


agttgttcaa 


catccaggtc 


aaggaggtca 


1800 


cgacgaacga 


cggcgttacg 


accatcgcta 


ataaccttac 


cagcacgatt 


caggtcttct 


1860 


cggactcgga 


gtaccaactg 


ccgtacgtcc 


tcggctctgc 


gcaccagggc 


tgcctccctc 


1920 


cgttccctgc 


ggacgtgttc 


atgattcctc 


agtacggata 


tctgactcta 


aacaacggca 


1980 


gtcagtctgt 


gggacgttcc 


tccttctact 


gcctggagta 


ctttccttct 


cagatgctga 


2040 


gaacgggcaa 


taactttgaa 


ttcagctacc 


agtttgagga 


cgtgcccttt 


cacagcagct 


2100 


acgcgcacag 


ccaaagcctg 


gaccggctga 


tgaaccccct 


catcgaccag 


tacctgtact 


2160 


acctgtctcg 


gactcagtcc 


acgggaggta 


ccgcaggaac 


tcagcagttg 


ctattttctc 


2220 




taataacatg 


tcggctcagg 


ccaaaaactg 


gctacccggg 


ccctgctacc 


2280 


ggcagcaacg 


cgtctccacg 


acactgtcgc 


aaaataacaa 


cagcaacttt 


gcttggaccg 


2340 


gtgccaccaa 


gtatcatctg 


aatggcagag 


actctctggt 


aaatcccggt 


gtcgctatgg 


2400 


caacgcacaa 


ggacgacgaa 


gagcgatttt ttccatccag 


cggagtcttg 


atgtttggga 


2460 


aacagggagc 


tggaaaagac 


aacgtggact 


atagcagcgt 


tatgctaacc 


agtgaggaag 


2520 


aaatcaaaac 


caccaaccca 


gtggccacag 


aacagtacgg 


cgtggtggcc 


gataacctgc 


2580 


aacagcaaaa 


cgccgctcct 


attgtagggg 


ccgtcaacag 


tcaaggagcc 


ttacctggca 


2640 


taacctgcrca 


gaaccgggac 


gtgtacctgc 


agggtcctat 


ctgggccaag 


attcctcaca 


2700 


cggacggcaa 


ctttcatcct 


tcgccgctga 


tgggaggctt 


tggactgaaa 


cacccgcctc 


2760 


ctcagatcct 


gattaagaat 


acacctgttc 


ccgcggatcc 


tccaactacc 


ttcagtcaag 


2820 


ccaagctggc 


gtcgttcatc 


acgcagtaca 


gcaccggaca 


ggtcagcgtg 


gaaattgaat 


2880 


gggagctgca 


gaaagagaac 


agcaagcgct 


ggaacccaga 


gattcagtat 


acttccaact 


2940 


acnacaaa tc 




gactttgctg 


tcaatactga gggtacttat 


tcagagcctc 


3000 


gccccattgg 


cacccgttac 


ctcacccgta acctgtaatt gcctgttaat caataaaccg 


3060 


gttaattcgt 


ttcagttgaa 


ctttggtctc 


tgcgaagggc 


gaattc 




3106 



<210> 35 
<211> 2489 
<212> DNA 

<213> new AAV serotype, clone 42.10 
<400> 35 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 
gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtgaagt 120 
ccgccaaggc cattcatcat ctgctggggc gggctcccga gattgcttgc tcggcctgcg 180 
atctggtcaa cgtggacctg gatgactgtg tttctgagca ataaatgact taaaccaggt 240 
atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 300 
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gagtggtggg 


CL L. t-t y ta. c* t4 >w- iw< 


t crcracr cccc cr 

v* y y W4. y w » w w y 


aaacccaaag 


ccaaccagca 


aaagcaggac 


360 


gacggccggg 




U LL. Uy yLLCLL 


a ay lull l i*y 


tracccttcaa 


ccr tract ccr ac 

w y y u. w o w y u w 


420 


aagggagagc 




y y uay auyuu 


rrprrrrppptpcr 


aocarfiacaa 


crcrcetaccrac 


480 


a age age teg 


anr a rt rr n rt n a 
ay i»»yyyy y° 


raaccccrtac 


rtcaacrtaca 


accacoccaa 

u. Vp* w va w y w v y v* 


cgccgagt tt 


540 


caggagegtc 


L. LLaay aaya 


t accrtctttt 


crcrcrcrcfcaacc 
y y y y j 


"t c cr cr cr c cr a cr c 


agtcttccag 


600 


gecaagaage 


yyy l. l. c tuy a 


«LL uu lu y y L. 


pt*crcrtt era crcr 


U4 y y y U> U U 


gacggctcct 


660 


ggacLa.gcia.yci 


y aui/UQ Lay u 


atccccccrac 


tcctccacgg 


gcat eggcag 


gaaaggccag 


720 


^ t*t r€ "t" a 
Cay CCCgCLd 


aaflay ac*y ul 


p a a ct 1 1 crcr cr 

- cj. ci *w l» tyy y 


cacractcrcrccf 


actcagagtc 


agtgcccgac 


780 


cctcaaccaa 


teggagaace 


uLUUvj Lay y u 


pfPtptrfrrt~p 


t"crcfcratc!tcfcr 
yyy yy 


tacaat ggct 


840 


geaggeggtg 


gcgctccaat 


y y Lay aLaa l. 


aaLy aayy uy 


pprraprrfTsffi" 

Luy auy y uy u. 


crcrcrtaat crcc 


900 


teeggaaatt ggcattgega 


ttccacatcrcr 


ctaaocaaca 

w ^ h m j y *■* *-* 


gagtcat cac 


caccagcacc 


960 


cgcacctggg 


ccctgcccac 


ctacaacaac 


cacctctaca 


agcagatatc 


aagtcagagc 


1020 


ggggctacca 


acgacaacca 


cttcttcggc 


tacagcaccc 


cctgqggcta 


ttttgacttc 


1080 


aacagattcc 


actgccactt 


i—» i*t*»ai»ri*»GLi > fyj w 


crac t crcr c acre 


gactcatcaa 


caacaactgg 


1140 


ggattcegge 


ccagaaagct 


y uy y l- uuaay 


L Ly L. LLQQUa 


tpcacrcrtcaa 

w w w y y w c4 


ggaggt cacg 


1200 


acgaacgacg 


gcgttacgac 


l a uuy Li>aa l. 


aarrt"i"arra 


crcaccrattca 


ggtcttctcg 


1260 


gaeteggagt 


accaactgcc 


crta pcrtp etc 


crcfctct occrc 

y y i» w i#uwu w 


accagggekg 


cctccctccg 


1320 


ttccctgcgg 


acgtgttcat 


crattcctcacr 


tacggatat c 


t gactctaaa 


caaeggcagt 


1380 


cagtctgtgg gacgttcctc 


cttctactcrc 


ctggagtact 


t tccttctca 


aatactgaga 


1440 


aegggcaata 


actttgaatt 


w ti y *w o ui* u.v«» v« 


tttaaacraaa 

y y t*y 


t gcctttcca 


cagcagctat 


1500 


gcgcacagcc 


agagcctgga 


uoyy Luyauy 


aatrccctca 


tccfaccacfta 


cctgtactac 


1560 


ctggcccgga 


cccagagcac 


+* a rrrrrrn'tp p 
lul> Lj yyy 


acaacr cr aaoc 

u w u wi yyy ^* y ^ 


tgcagttcca 


tcaaactaaa 


1620 


cccaacacca 


tggecgagea 


a LLaaay aa<-« 


t*ercrcfccrccccr 


craccctcrtt a 


teggcagcag 


1680 


agactgtcaa 


aaaacataga 


Lay Q.uu QO.L- 


aacacrtaact 


ttgcctggac 


caaooccact 
*"yy yy 


1740 


aaataccatc 


tgaatggtag 


aaau LOaLLa 




rfpfft~arrppa1~ 

y \m* y w u y w w d w 


ggccaccaac 


1800 


aaggacgacg 


aggaccagtt 


l i— u u c ll a l l 


saprrrfarr'T'fTP 
a a v*> y y ay l, y 


■fc cr cr 1 1 1 1 1 crcr 

w y y ^wwwi#yy 


caaaaegggg 


1860 


gctgccaaca 


agacaacget 


crcfa aaaccrtcr 

y y cl ti a ul. y l. y 


ctaatcr acca 


acaaacfaaoa 


gatcaaaacc 


1920 


accaatcccg 


tggctacaga 


acraataccrcrt 

u.y la a d w y y 


gtggtctcca 


gcaacctgca 


ategtctacg 


1980 


gccggacccc 


agacacagac 


1" rr t* p a a p a cj r* 


pacrcrcrfTcrc tc 
y yyy y 


tgcccggcat 




2040 


aacegggacg 


tgtacctgea 


gggtcccatc 


tgggccaaaa 


ttcctcacac 


ggaeggcaac 


2100 


tttcacccgt 


ctcccctgat 


gggcggattt 


ggactcaaac 


acccgcctcc 


tcaaattctc 


2160 


atcaaaaaca 


ccccggtacc 


tgetaatect 


ccagaggtgt 


ttactcctgc 


caagtttgee 


2220 
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1 



tcatttatca cgcagtacag caccggccag gtcagcgtgg agatcgagtg ggaactgcag 2280 

aaagaaaaca gcaaacgctg gaatccagag attcagtaca cctcaaatta tgccaagtct 2340 

aataatgtgg aatttgctgt caacaacgaa ggggtttata ctgagcctcg ccccattggc 2400 

acccgttacc tcacccgtaa cctgtaattg cctgttaatc aataaaccgg ttaattcgtt 2460 

tcagttgaac tttggtcaag ggcgaattc 2489 

<210> 36 

<211> 2495 

<212> DNA 

<213> new AAV serotype, clone 42.3b 

<400> 36 



gaattcgccc 


tttctacggc 


tgcgtcaact 


agaccaatga 


gaactttccc 


ttcaacgatt 


60 


gcgtcgacaa 


gatggtgatc 


tggtgggagg 


agggcaagat 


gacggccaag 


gtcgtggagt 


12 0 


ccgccaaggc 


cattcatcat 


ctgctggggc 


gggctcccga gattgcttgc tcggcctgcg 


180 


atctggtcaa 


cgtggacctg 


gatgactgtg 


tttctgagca 


ataaatgact 


taaaccaggt 


240 


atggctgccg 


atggttatct 


tccagattgg 


ctcgaggaca 


acctctctga 


gggcattcgc 


300 


gagtggtggg 


acttgaaacc 


tggagccccg 


aaacccaaag 


ccaaccagca 


aaagcaggac 


360 


gacggccggg 


gtctggtgct 


tcctggctac 


aagtacctcg 


gacccttcaa 


cggactcgac 


420 


aagggagagc 


cggtcaacga 


ggcagacgcc 


gcggccctcg 


agcacgacaa 


ggcctacgac 


480 


aagcagctcg 


agcaggggga 


caacccgtac 


ctcaagtaca 


accacgccga 


cgccgagttt 


540 


caggagcgtc 


ttcaagaaga 


tacgtctttt 


gggggcaacc 


tcgggcgagc 


agtcttccag 


600 


gccaagaagc 


gggttctcga 


acctctcggt 


ctggttgagg 


aaggcgctaa 


gacggctcct 


660 


ggaaagaaga 


gacccataga 


atcccccgac 


tcctccacgg 


gcatcggcaa 


gaaaggccag 


720 


cagcccgcta 


aaaagaagct 


caactttggg 


cagactggcg 


actcagagtc 


agtgcccgac 


780 


cctcaaccaa 


tcggagaacc 


ccccgcaggc 


ccctctggtc 


tgggatctgg 


tacaatggct 


840 


gcaggcggtg 


gcgctccaat 


ggcagacaat 


aacgaaggcg 


ccgacggagt 


gggtaatgcc 


900 


tccggaaatt 


ggcattgcga 


ttccacatgg ctgggcgaca gagtcatcac 


caccagcacc 


960 


cgcacctggg 


ccctgcccac 


ctacaacaac 


cacctctaca 


agcagatatc 


aagtcagagc 


1020 


ggggctacca 


acgacaacca 


cttcttcggc 


tacagcaccc 


cctggggcta 


ttttgacttc 


1080 


aacagattcc 


actgccactt 


ctcaccacgt 


gactggcagc 


gactcatcaa 


caacaactgg 


1140 


ggattccggc 


ccagaaagct 


gcggttcaag 


ttgttcaaca 


tccaggtcaa 


ggaggtcacg 


1200 


acgaacgacg 


gcgttacgac 


catcgctaat 


aaccttacca 


gcacgattca 


ggtcttctcg 


1260 


gactcggagt 


accaactgcc 


gtacgtcctc 


ggctctgcgc 


accagggctg 


cctccctccg 


1320 


ttccctgcgg 


acgtgttcat 


gattcctcag 


tacggatatc 


tgactctaaa 


caacggcagt 


1380 


cagtctgtgg 


gacgttcctc 


cttctactgc 


ctggagtact 


ttccttctca 


gatgctgaga 


1440 
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acgggcaata 

gcgcacagcc 

ctggcccgga 

cccaacacca 

agactgtcaa 

aaataccatc 

aaggacgacg 

gctgccaaca 

accaatcccg 

gccggacccc 

aaccgggacg 

tttcacccgt 

atcaaaaaca 

tcatttatca 

aaagaaaaca 

aataatgtgg 

acccgttacc 

tcagttgaac 



actttgaatt 

agagcctgga 

cccagagcac 

tggccgagca 

aaaacataga 

tgaatggtag 

aggaccagtt 

agacaacgct 

tggctacaga 

agacacagac 

tgtacctgca 

ctcccctgat 

ccccggtacc 

cgcagtacag 

gcaaacgctg 

aatttgctgt 

tcacccgtaa 

tttggtctct 



cagctacacc 

ccggctgatg 

tacggggtcc 

atcaaagaac 

cagcaacaac 

aaattcatta 

ctttcccatc 

ggaaaacgtg 

acagtacggt 

tgtcaacagc 

gggtcccatc 

gggcggattt 

tgctaatcct 

caccggccag 

gaatccagag 

caacaacgaa 

cctgtaattg 

gcgaagggcg 



tttgaggaag 

aatcccctca 

acaagggagc 

tggctgcccg 

accagtaact 

accaacccgg 

aacggagtgc 

ctaatgacca 

gtggtctcca 

cagggggctc 

tgggccaaaa 

ggactcaaac 

ccagaggtgt 

gtcagcgtgg 

attcagtaca 

ggggtttata 

cctgttaatc 

aattc 



tgcctttcca 

tcgaccagta 

tgcagttcca 

gaccctgtta 

ttgcctggac 

gcgtagccat 

tggtttttgg 

gcgaggagga 

gcaacctgca 

tgcccggcat 

ttcctcacac 

acccgcctcc 

ttactcctgc 

agatcgagtg 

cctcaaatta 

ctgagcctcg 

aataaaccgg 



cagcagctat 

cctgtactac 

tcaggctggg 

tcggcagcag 

cggggccact 

ggccaccaac 

caaaacgggg 

gatcaaaacc 

atcgtctacg 

ggtctggcag 

ggacggcaac 

tcaaattctc 

caagtttgcc 

ggaactgcag 

tgccaagtct 

ccccattggc 

ttaattcgtt 



<210> 37 
<211> 3098 
<212> DNA 

<213> new AAV serotype, clone 42.11 
<400> 37 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 
gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 
ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcttccg 
cccagatcga tcccaccccc gtgatcgtca cttccaacac caacatgtgc gccgtgattg * 
acgggaacag caccaccttc gagcaccagc agccgttaca agaccggatg ttcaaatttg 
aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcaaagagt 
tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 
gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 
gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 
acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 
agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accggagact 



1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2495 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 
ggaaactctg tgccattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 780 
gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 84 0 
ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 900 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaagggag agccggtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 108 0 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 1140 

tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagacccat agaatccccc gactcctcca cgggcatcgg caagaaaggc 1320 

cagcagcccg ctaaaaagaa gctcaacttt gggcagactg gcgactcaga gtcagtgccc 1380 

gaccctcaac caatcggaga accccccgca ggcccctctg gtctgggatc tggtacaatg 1440 

gctgcaggcg gtggcgctcc aatggcagac aataacgaag gcgccgacgg agtgggtaat 1500 

gcctccggaa attggcattg cgattccaca tggctgggcg acagagtcat caccaccagc 1560 

acccgcacct gggccctgcc cacctacaac aaccacctct acaagcagat atcaagtcag 1620 

agcggggcta ccaacgacaa ccacttcttc ggctacagca ccccctgggg ctattttgac 1680 

ttcaacagat tccactgcca cttctcacca cgtgactggc agcgactcat caacaacaac 1740 

tggggattcc ggcccagaaa gctgcggttc aagttgttca acatccaggt caaggaggtc 1800 

acgacgaacg acggcgttac gaccatcgct aataacctta ccagcacgat tcaggtcttc 18 60 

tcggactcgg agtaccaact gccgtacgtc ctcggctctg cgcaccaggg ctgcctccct 1920 

ccgttccctg cggacgtgtt catgattcct cagtacggat atctgactct aaacaacggc 1980 

agtcagtctg tgggacgttc ctccttctac tgcctggagt actttccttc tcagatgctg 2040 
agaacgggca ataactttga attcagctac acctttgagg aagtgccttt ccacagcagc 2100 
tatgcgcaca gccagagcct ggaccggctg atgaatcccc tcatcgacca gtacctgtac 2160 
tacctggccc ggacccagag cactacgggg tccacaaggg agctgcagtt ccatcaggct 2220 
gggcccaaca ccatggccga gcaatcaaag aactggctgc ccggaccctg ttatcggcgg 2280 
cagagactgt caaaagacat agacagcaac aacaacagta actttgcctg gaccggggcc 2340 
actaaatacc atctgaatgg tagaaattca ttaaccaacc cgggcgtagc catggccacc 24 00 
aacaaggacg acgaggacca gttctttccc atcaacggag tgctggtttt tggcaaaacg 2460 
ggggctgcca acaagacaac gctggaaaac gtgctaatga ccagcgagga ggagatcaaa 2520 
accaccaatc ccgtggctac agaagaatac ggtgtggtct ccagcaacct gcaatcgtct 2580 
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acggccggac 
cagaaccggg 
aactttcacc 
ctcatcaaaa 
gcctcattta 
cagaaagaga 
tctaataatg 
ggcacccgtt 
gtttcagttg 



cccagacaca 
acgtgtacct 
cgtctcccct 
acaccccggt 
tcacgcagta 
acagcaaacg 
tggaatttgc 
acctcacccg 
aactttggtc 



gactgtcaac 
gcagggtccc 
gatgggcgga 
acctgctaat 
cagcaccggc 
ctggaatcca 
tgtcaacaac 
taacctgtaa 
tctgcgaagg 



agccaggggg ctctgcccgg 
atctgggcca aaattcctca 
tttggactca aacacccgcc 
cctccagagg tgtttactcc 
caggtcagcg tggagatcga 
gagattcagt acacctcaaa 
gaaggggttt atactgagcc 
ttacttgtta atcaataaac 
gcgaattc 



catggtctgg 
cacggacggc 
tcctcaaatt 
tgccaagttt 
gtgggaactg 
ttatgccaag 
tcgccccatt 
cggttgattc 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3098 



<210> 38 
<211> 3276 
<212> DNA 

<213> new AAV serotype, clone 42.6a 
<400> 38 

gaattcgccc ttcgcagaga ccaaagttca actgaaacga attaaccggt ttattgatta 
acaggcaatt acaggttacg ggtgaggtaa cgggtgccaa tggggcgagg ctcagtataa 
accccttcgt tgttgacagc aaattccaca ttattagact tggcataatt tgaggtgtac 
tgaatctctg gattccagcg tttgctgttt tctttctgca gttcccactc gatctccacg 
ctgacctggc cggtgctgta ctgcgtgata aatgaggcaa acttggcagg agtaaacacc 
tctggaggat tagcaggtac cggggtgttt ttgatgagaa tttgaggagg cgggtgtttg 
agtccaaatc cgtccatcag gggagacggg tgaaagttgc cgtccgtgtg aggaattttg 
gcccagatgg gaccctgcag gtacacgtcc cggttctgcc agaccatgcc gggcagagcc 
ccctggctgt tgacagtctg tgtctggggt ccggccgtag acgattgcag gttgctggag 
accacaccgt attcttctgt agccacggga ttggtggttt tgatctcctc ctcgctggtc 
attagcacgt tttccagcgt tgtcttgttg gcagcccccg ttttgccaaa aaccagcact 
ccgttgatgg gaaagaactg gtcctcgtcg tccttgttgg tggccatggc tacgcccggg 
ttggttaatg aatttctacc attcagatgg tatttagtgg ccccggtcca ggcaaagtta 
ctgttgttgt tgctgtctat gttttttgac agtctctgct gccgataaca gggtccgggc 
agccagttct ttgattgctc ggccatggtg ttgggcccag cctgatggaa ctgcagctcc 
cttgtggacc ccgtagtgct ctgggtccgg gccaggtagt acaggtactg gtcgatgagg 
ggattcatca gccggtccag gctctggcta tgcgcatagc tgctgtggaa aggcacttcc 
tcaaaggtgt agctgaattc aaagttattg cccgttctca gcatctgaga aggaaagtac 
tccaggcagt agaaggagga acgtcccaca gactgactgc cgttgtttag agtcagatat 
ccgtactgag gaatcatgaa cacgtccgca gggaacggag ggaggcagcc ctggtgcgca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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gagccgagga 


cgtacggcag 


ttggtactcc 


gagtccgaga agacctgaat 


cgtgctggta 


1260 


aggttat tag 


cgatqgtcgt 


aacgccgtcg 


tccgtcgtga cctccttgac 


ctggatgttg 


1320 


aacaacttga 


accgcagctt 


tetgggcegg 

3 3 3 3 3 


aatccccagt tgttgttgat 


gagtegctge 


1380 


cagtcacgtg 


gtgagaagtg 

3 " ^3 -3 3 


gcagtggaat 


ctgttaaagt caaaataccc 


ccagggggtg 


1440 


ctgtagccga 


agtaggtgtt 


gtcgttgqtg 

3 3 3 3 3 


cttcctcccg atgtcccgtt 


ggagatttgc 


1500 


ttgtagaggt 


era tt at tot a 


ggtggggagg 

33 w 33 33 33 


gcccaggttc gggtgctggt 

3 3 3 3333 3 3 


ggtgatgact 

3 3 3 3 


1560 


ctgtcgccca 


gccatgtgqa 


ategcaatge 


caatttcctg aggaactacc 


cactccgtcg 


1620 


gcgcctt cgt 


tattgtctgc 


cattggagcg 

3 3 3 3 


ccaccgcctg cagecattgt 


accagatccc 


1680 


agaccagagg 


acrcctacagcf 


gggttctccg 


attggttgag ggtegggcac 


tgactctgag 


1740 


tcgccagtct 


geccaaagtt 


gagtctcttt 


ttcgeggget gctggcctgt 


ettgecgatg 


1800 


cccgt agagq 

^ ^ ^3 3 3 


agtctgqaga 


acgctggggt 

3 3 3 3 B 


gatggctcta ccggtctctt 


ctttccagga 


1860 


gccgtcttag 


cgccttcctc 


aaccagaccg 


agaggttcga gaacccgctt 


cttggcctgg 


1920 


aagactgctc 


geccgaggtt 


gcccccaaaa 


gaegtatett cttgaagacg 


ctcctgaaac 


1980 


tcggcgtcgg 


cgtggttgta 

3 3 3 3 


cttgaggtac 


gggttgtccc cctgctcgag 


ctgcttgtcg 


2040 


taggccttgt 


cgtgctcgag 

3 3 3 3 


ggccgcggcg 

3 3 3 3 3 3 


tctgcctcgt tgaccggctc 


tcccttgtcg 


2100 


agtccgttga 


agggtccgag 

3 3 3 3 3 


gtacttgtag 


ccaggaagca ccagaccccg 


gccgtcgtcc 


2160 


tgcttttgct 


qgttqgcttt 

3 3 33 


gggtttcggq 

333 3 3 3 


gctccaggtt tcaagtccca 


ccactcgcga 


2220 


atgccct cag 


agaggttgtc 

3 3 3 3 


ctcgagccaa 


tctggaagat aaccatcggc 


agccatacct 


2280 


aatttaaatc 


atttat tget 


cagaaacaca 


gtcatccagg tccacgttga 


ccagatcgca 


2340 


ggccgagcaa 


geaatctegg 


gagcccgccc 


cagcagatga tgaatggcac 


agagtttccg 


2400 


atacgtcctc 


tttctgacga 


ccggttgaga 

3 3 3 3 


ttctgacacg ceggggaaac 


attctgaaca 


2460 


gtctctggtc 


ccgtgcgtga 

3 3 3 ""3~ 


agcaaatgtt 


gaaattctga ttcattctct 


cgcatgtctt 


2520 


gcagggaaac 

3 3 3 3 


agcatctgaa 


gcatgcccgc 


gtgacgagaa cacttgtttt 


ggtacctgtc 


2580 


ggcaaagtcc 


accggagctc 


cttccgcgtc 


tgaegtcgat ggatgcaaaa 


tgtcgcaaaa 


2640 


gcactcacgt 


gacagctaat 


acaggaccac 


tcccctatga cgtgatttac 


gtcagegcta 


2700 


t gcccgcgt g 


acgagaacat 


ttgttttggt 


acctgtcggc aaagtccacc 


ggagctcctt 


2760 




cq tc gat acra 


tccgcgactg 


aggggcaggc ccgcttgggc 

**3333 wfc *33 w w 3 v w **3 3 3 


tegcttttat 


2820 


ccgcgtcatc 


aggggegggt 

33333 , '"333 


ctcttgttgg 


ctccaccctt tetgaegtag 


aactcatgeg 


2880 


ccacctcggt 


cacgtgatcc 


tgcgcccagc 


ggaagaactc tttgacttcc 


tgctttgtca 


2940 


ccttgccaaa 


gtcatgctcc 


agaeggeggg tgagttcaaa tttgaacatc 


cggtcctgca 


jUUU 


acggctgctg 


gtgctcgaag 


gtggtgctgt 


tcccgtcaat cacggcgcac 


atgttggtgt 


3060 


tggaagtgac 


gatcaegggg 


gtgggatcga 


tctgggcgga agacttgeae 


ttttggtcca 


3120 
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cgcgcacctt gctgccgccg agaatggcct tggcggactc cacgaccttg gccgtcatct 3180 

tgccctcctc ccaccagatc accatcttgt cgacgcaatc gttgaaggga aagttctcat 3240 

tggtccagtt gacgcagccg tagaaagggc gaattc 3276 

<210> 39 
<211> 3084 
<212> DNA 

<213> new AAV serotype, clone 43.1 
<400> 39 

gaattcgccc tttctacggc tgcatcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 180 

cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 

acgggaacag caccaccttc gagcaccagc agccgttgca ggaccggatg ttcaagttcg 300 

aactcacccg ccgtctggag cacgactttg gcaaggtgac caagcaggaa gtcaaagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 420 

gcggagccag caaaagaccc gcccccgatg acgcggatat aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 



600 
660 



840 
900 
960 



acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 
aaacgtgcga gaaaatgaat cagaatttca acatttgctt cacgcacggg gtcagagact 

gctcagaatg tttccccggt gcatcagaat ctcaaccggt cgtcagaaaa aaaacgtatc 720 

agaaactgtg tgccattcat catctgctgg ggcgggcacc cgagattgct tgctcggcct 780 
gcgatctggt caacgtggac ctggacgact gtgtttctga gcaataaatg acttaaacca 
ggtatggctg ccgatggtta tcttccagat tggcttgagg acaacctctc tgagggcatt 
cgcgagtggt gggacctgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 1140 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagaccggt agagccatca cctcagcgtt cccccgactc ctccacgggc 1320 

atcggcaaga aaggccacca gcccgcgaga aagagactga actttgggca gactggcgac 1380 

tcggagtcag tccccgaccc tcaaccaatc ggagaaccac cagcaggccc ctctggtctg 1440 

ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 1500 

gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 1560 
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gtcatcacca ccagcacccg aacctgggcc ctgcccacct acaacaacca tctctacaag 1620 

caaatctcca acgggacatc gggaggaagc actaacgaca acacctactt tggctacagc 1680 

accccctggg ggtattttga cttcaacaga ttccactgcc acttctcacc acgtgactgg 1740 

cagcgactca tcaacaataa ctggggattc cggcccaaga gactcaactt caagctcttc 1800 

aacatccagg tcaaggaggt cacgcagaat gaaggcacca agaccatcgc caataacctt 1860 

accagcacga ttcaggtgtt tacggactcg gaataccagc tcccgtacgt ccccggctct 1920 

gcgcaccagg gctgcctccc tccgttcccg gcggacgtct tcatgattcc tcagtacggg 1980 

tatctgaccc taaacaatgg cagtcaggct gtgggccgtt cctccttcta ctgcctggaa 2040 

tacttccctt ctcaaatgct gaggacgggc aacaactttg aattcagcta caccttcgag 2100 

gacgtgcctt tccacagcag ctacgcgcac agccagagcc tggaccggct gatgaaccct 2160 

ctcatcgacc agtacctgta ttacttatcc agaactcagt ccacaggagg aactcaaggt 2220 

actcagcaat tgttattttc tcaagccggg cccgcaaaca tgtcggctca ggccaagaac 2280 

tggctacctg gaccgtgtta ccgtcagcaa cgagtttcca cgacactgtc gcaaaacaac 2340 

aacagcaatt ttgcttggac cggtgccacc aagtatcacc tgaatggcag agactccctg 2400 

gttaatcccg gcgttgccat ggctacccac aaggacgacg aggagcgctt cttcccgtca 2460 

agcggagttc taatgtttgg caagcagggg gctggaaaag acaatgtgga ctacagcagc 2520 

gtgatgctca ccagcgaaga agaaattaaa actactaacc cagtggctac agagcagtat 2580 

ggtgtggtgg cagacaacct gcagcagacc aacggagctc ccattgtggg aactgtcaac 2640 

agccaggggg ccttacctgg tatggtctgg caaaaccggg acgtgtacct gcagggcccc 2700 

atctgggcca aaattcctca cacggacggc aactttcatc cttcgccgct gatgggaggc 2760 

tttggactga aacacccgcc tcctcagatc ctggtgaaaa acactcctgt tcctgcggat 2820 

cctccgacca ccttcagcca ggccaagctg gcttctttta tcacgcagta cagcaccgga 2 880 

caggtcagcg tggaaatcga atgggagctg cagaaagaaa acagcaagcg ctggaaccca 2 940 

gagattcagt atacttccaa ctactacaaa tctacaaatg tggactttgc tgtcaatact 3000 

gagggtactt attcagagcc tcgccccatt ggcactcgtt atctcacccg taatctgtaa 3060 

ttgcttgtta atcaataaac cggt 3084 

<210> 40 
<211> 2370 
<212> DNA 

<213> new AAV serotype, clone 43.5 



60 



<400> 40 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 180 
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cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 
acgggaacag caccaccttc gagcaccagc agccgttgca ggaccggatg ttcaagttcg 
aactcacccg ccgtctggag cacgactttg gcaaggtgac caagcaggaa gtcaaagagt 
tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 
gcggagccag caaaagaccc gcccccgatg acgcggatat aagcgagccc aagcgggcct 
gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 
acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagacgctg tttccctgca 
aaacgtgcga gagaatgaat cagaatttca acatttgctt cacgcacggg gtcagagact 
gctcagaatg tttccccggt gcatcagaat ctcaaccggt cgtcagaaaa aaaacgtatc 
agaaactgtg tgccattcat catctgctgg ggcgggcacc cgagattgct tgctcggcct 
gcgatctggt caacgtggac ctggacgact gtgtttctga gcaataaatg acttaaacca 
ggtatggctg ccgatggtta tcttccagat tggcttgagg acaacctctc tgagggcatt 
cgcgagtggt gggacctgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 
gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 
gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 
gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 
tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 
caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 
cctggaaaga agagaccggt agagccatca cctcagcgtt cccccgactc ctccacgggc 
atcggcaaga aaggccacca gcccgcgaga aagagactga actttgggca gactggcgac 
tcggagtcag tccccgaccc tcaaccaatc ggagaaccac cagcaggccc ctctggtctg 
ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 
gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 
gtcatcacca ccagcacccg aacctgggcc ctgcccacct acaacaacca tctctacaag 
caaatctcca acgggacatc gggaggaagc actaacgaca acacctactt tggctacagc 
accccctggg ggtattttga cttcaacaga ttccactgcc acttctcacc acgtgactgg 
cagcgactca tcaacaataa ctggggattc cggcccaaga gactcaactt caagctcttc 
aacatccagg tcaaggaggt cacgcagaat gaaggcacca agaccatcgc caataacctt 
accagcacga ttcaggtgtt tacggactcg gaataccagc tcccgtacgt cctcggctct 
gcgcaccagg gctgcctccc tccgttcccg gcggacgtct tcatgattcc tcagtacggg 
tatctgaccc taaacaatgg cagtcaggct gtgggccgtt cctccttcta ctgcctggaa 
tacttccctt ctcaaatgct gaggacgggc aacaactttg aattcagcta caccttcgag 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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gacgtgcctt tccacagcag ctacgcgcac agccagagcc tggaccggct gatgaaccct 2160 

ctcatcgacc agtacctgta ttacttatcc agaactcagt ccacaggagg aactcaaggt 2220 

actcagcaat tgttattttc tcaagccggg cccgcaaaca tgtyggctca ggccaagaac 2280 

tggctacctg gaccgtgtta ccgtcagcaa cgagtttcca cgacactgtc gcaaaacaac 2340 

aacagcaatt ttgctggacc ggtgccacca 2370 

<210> 41 
<211> 3123 
<212> DNA 

<213> new AAV serotype, clone 43.12 
<400> 41 

gaattcgccc ttggctgcgt caactggacc aatgagaact ttcccttcaa cgattgcgtc 60 

gacaagatgg tgatctggtg ggaggagggc aagatgacgg ccaaggtcgt ggagtccgcc 120 

aaggccattc tcggcggcag caaggtgcgc gtggaccaaa agtgcaagtc gtccgcccag 180 

atcgacccca cccccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg 240 

aacagcacca ccttcgagca ccagcagccg ttgcaggacc ggatgttcaa gttcgaactc 300 

acccgccgtc tggagcaoga ctttggcaag gtgaccaagc aggaagtcaa agagttcttc 360 

cgctgggcgc aggatcacgt gaccgaggtg gcgcatgagt tctacgtcag aaagggcgga 420 

gccagcaaaa gacccgcccc cgatgacgcg gatataagcg agcccaagcg ggcctgcccc 480 

tcagtcgcgg atccatcgac gtcagacgcg gaaggagctc cggtggactt tgccgacagg 540 

taccaaaaca aatgttctcg tcacgcgggc atgctccaga tgctgtttcc ctgcaaaacg 600 

tgcgagagaa tgaatcagaa tttcaacatt tgcttcacgc acggggtcag agactgctca 660 

gaatgtttcc ccggtgcatc agaatctcaa ccggtcgtca gaaaaaaaac gtatcagaaa 720 

ctgtgtgcca ttcatcatct gctggggcgg gcacccgaga ttgcttgctc ggcctgcgat 780 

ctggtcaacg tggacctgga cgactgtgtt tctgagcaat aaatgactta aaccaggtat 840 

ggctgccgat ggttatcttc cagattggct tgaggacaac ctctctgagg gcattcgcga 900 

gtggtgggac ctgaaacctg gagccccgaa acccaaagcc aaccagcaaa agcaggacga 960 

cggccggggt ctggtgcttc ctggctacaa gtacctcgga cccttcaacg gactcgacaa 1020 

gggggagccc gtcaacgcgg cggacgcagc ggccctcgag cacgacaagg cctacgacca 1080 

gcagctcaaa gcgggtgaca atccgtacct gcggtataac cacgccgacg ccgagtttca 1140 

ggagcgtctg caagaagata cgtcttttgg gggcaacctc gggcgagcag tcttccaggc 1200 

caagaagcgg gttctcgaac ctctcggtct ggttgaggaa ggcgctaaga cggctcctgg 12 60 

aaagaagaga ccggtagagc catcacctca gcgttccccc gactcctcca cgggcatcgg 1320 

caagaaaggc caccagcccg cgagaaagag actgaacttt gggcagactg gcgactcgga 1380 

gtcagtcccc gaccctcaac caatcggaga accaccagca ggcccctctg gtctgggatc 1440 
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tggtacaatg gctgcaggcg 


gtggcgctcc 


aatggcagac 


aataacgaag 


gcgccgacgg 


1500 


agtgggtagt tcctcaggaa 


attggcattg 


cgattccaca 


tggctgggcg 


acagagtcat 


1560 


caccaccagc acccgaacct 


gggccctgcc 


cacctacaac 


aaccatctct 


acaagcaaat 


1620 


ctccaacggg acatcgggag 


gaagcactaa 


cgacaacacc 


tactttggct 


acagcacccc 


1680 


ctgggggtat tttgacttca 


acagattcca 


ctgccacttc 


tcaccacgtg 


actggcagcg 


1740 


actcatcaac aataactggg 


gattccggcc 


caagagactc 


aacttcaagc 


tcttcaacat 


1800 


ccaggtcaag gaggtcacgc 


agaatgaagg 


caccaagacc 


atcgccaata 


accttaccag 


1860 


cacgattcag gtgtttacgg 


actcggaata 


ccagctcccg 


tacgtcctcg 


gctctgcgca 


1920 


ccagggctgc ctccctccgt 


tcccggcgga 


cgtcttcatg 


attcctcagt 


acgggtatct 


1980 


gaccctaaac aatggcagtc 


aggctgtggg 


ccgttcctcc 


ttctactgcc 


tggaatactt 


2040 


cccttctcaa atgctgagga 


cgggcaacaa 


ctttgaattc 


agctacacct 


tcgaggacgt 


2100 


gcctttccac agcagctacg 


cgcacagcca 


gagcctggac 


cggctgatga 


accctctcat 


2160 


cgaccagtac ctgtattact 


tatccagaac 


tcagtccaca 


ggaggaactc 


aaggtactca 


2220 


gcaattgtta ttttctcaag 


ccgggcccgc 


aaacatgtcg 


gctcaggcca 


agaactggct 


2280 


acctggaccg tgttaccgtc 


agcaacgagt 


ttccacgaca 


ctgtcgcaaa 


acaacaacag 


2340 


caattttgct tggaccggtg 


ccaccaagta 


tcacctgaat 


ggcagagact 


ccctggttaa 


2400 


tcccggcgtt gccatggcta 


cccacaagga 


cgacgaggag 


cgcttcttcc 


cgtcaagcgg 


2460 


agttctaatg tttggcaagc 


agggggctgg 


aaaagacaat 


gtggactaca 


gcagcgtgat 


2520 


gctcaccagc gaagaagaaa 


ttaaaactac 


taacccagtg 


gctacagagc 


agtatggtgt 


2580 


ggtggcagac aacctgcagc 


agaccaacgg 


agctcccatt 


gtgggaactg 


tcaacagcca 


2640 


gggggcctta cctggtatgg 


tctggcaaaa 


ccgggacgtg 


tacctgcagg 


gccccatctg 


2700 


ggccaaaatt cctcacacgg 


acggcaactt 


tcatccttcg 


ccgctgatgg 


gaggctttgg 


2760 


actgaaacac ccgcctcctc 


agatcctggt 


gaaaaacact 


cctgttcctg 


cggatcctcc 


2820 


gaccaccttc agccaggcca 


agctggcttc 


ttttatcacg 


cagtacagca 


ccggacaggt 


2880 


cagcgtggaa atcgaatggg 


agctgcagaa 


agaaaacagc 


aagcgctgga 


acccagagat 


2940 


tcagtatact tccaactact 


acaaatctac 


aaatgtggac 


tttgctgtca 


atactgaggg 


3000 


tacttattca gagcctcgcc 


ccattggcac 


tcgttatctc 


acccgtaatc 


tgtaattgct 


3060 


tgttaatcaa taaaccggtt 


aattcgtttc 


agttgaactt 


tggtctctgc 


gaagggcgaa 


3120 


ttc 










3123 



<210> 42 
<211> 3122 
<212> DNA 

<213> new AAV serotype, clone 43.20 
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<400> 42 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgtgtgga ccaaaagtgc aagtcttccg 180 

cccagatcga tcccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 

acgggaacag cgccaccttc gagcaccagc agccgttgca ggaccggatg ttcaaatttg 300 

aactcacccg ccgtctggag catgactttg gcaaggtgac gaagcaggaa gtcaaagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttccac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggatat aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 660 

gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 

ggaaactctg tgcgattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 780 

gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 840 
ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 900 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaagcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataatcacgc cgacgccgag 1140 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagactggt agagcagtcg ccacaagagc cagactcctc ctcgggcatc 1320 

ggcaagacag gccagcagcc cgctaaaaag agactcaatt ttggtcagac tggcgactca 1380 

gagtcagtcc ccgacccaca acctctcgga gaacctccag cagccccctc aggtctggga 1440 

cctaatacaa tggcttcagg cggtggcgct ccaatggcag acaataacga aggcgccgac 1500 

ggagtgggta attcctcggg aaattggcat tgcgattcca catggctggg ggacagagtc 1560 

atcaccacca gcacccgaac ctgggccctg cccacctaca acaaccacct ctacaagcaa 1620 

atctccaacg gcacctcggg aggaagcacc aacgacaaca cctattttgg ctacagcacc 1680 

ccctgggggt attttgactjt caacagattc cactgtcact tttcaccacg tgactggcaa 174 0 

cgactcatca acaacaattg gggattccgg cccaaaagac tcaacttcaa gctgttcaac 1800 

atccaggtca aggaagtcac gacgaacgaa ggcaccaaga ccatcgccaa taatctcacc 1860 

agcaccgtgc aggtctttac ggactcggag taccagttac cgtacgtgct aggatccgct 1920 



78 



WO 03/042397 



PCT/US02/33629 



caccagggat 

ttaactttaa 

ttcccatcgc 

gtgcctttcc 

atcgaccagt 

actctggcat 

cccggacctt 

aactttgcct 

ccgggcgtgg 

gtcctgattt 

attacagatg 

gtggccatca 

ggggtgattc 

gccaaaattc 

ctgaagcacc 

cttaccttca 

agcgtggaaa 

caatacactt 

gtttatagcg 

gttaatcaat 

tc 



gtctgcctcc 

acaatggaag 

agatgctgag 

acagcagcta 

acctgtacta 

tcagccaagc 

gctaccggca 

ggacgggagc 

caatggcttc 

ttggcaagca 

aggaagaaat 

acaaccaggc 

ccggcatggt 

ctcacacgga 

cgcctcctca 

accaggccaa 

tcgagtggga 

ccaactacta 

agcctcgccc 

aaaccggtta 



gttcccggcg 

ccaagccctg 

aaccggcaac 

cgcgcacagc 

cctggtcaga 

gggtcctagc 

gcagcgcgtc 

tgccaagttt 

ccacaaggat 

aggagccggg 

caaggctacc 

cgccaatacg 

gtggcagaat 

cggcaacttt 

aattctcatc 

gctgaactct 

gctgcagaaa 

caaatctaca 

cattggcacc 

attcgtttca 



gacgtcttca 

ggacgttcct 

aactttcagt 

cagagcctgg 

acgcaaacga 

tcaatggcca 

tccacgacaa 

aagctgaacg 

gacgacgacc 

aacgatggag 

aaccccgtgg 

caggcgcaga 

agagacgtgt 

cacccgtctc 

aagaacacac 

ttcatcacgc 

gaaaacagca 

aatgtggact 

cgttacctca 

gttgaacttt 



cggttcctca 

ccttctactg 

tcagctacac 

acaggctgat 

ctggaactgg 

accaggctag 

ccaaccagaa 

gccgagactc 

gcttcttccc 

tggattacag 

ccacagaaga 

ccggactcgt 

acctgcaggg 

ccctgatggg 

cggttccagc 

agtacagcac 

aacgctggaa 

ttgctgtcaa 

cccgcaacct 

ggtctctgcg 



gtacggctat 

tctggagtat 

cttcgaggac 

gaatcccctc 

agggacgcag 

aaattgggtg 

caacaacagc 

tctaatgaat 

ttcgagcggg 

ccaagtgctg 

atatggagca 

gcacaaccag 

tcccatctgg 

cggctttgga 

ggacccgccg 

cggacaggtc 

tccagagatt 

cacggaagga 

gtaattacat 

aagggcgaat 



<210> 43 
<211> 3117 
<212> DNA 

<213> new AAV serotype, clone 43.21 
<400> 43 

gaattcgccc ttggctgcgt caactggacc aatgagaact ttcccttcaa cgattgcgtc 
gacaagatgg tgatctggtg ggaggagggc aagatgacgg ccaaggtcgt ggagtccgcc 
aaggccattc tcggcggcag caaggtgcgt gtggaccaaa agtgcaagtc ttccgcccag 
atcgatccca cccccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg 
aacagcacca ccttcgagca ccagcagccg ttgcaggacc ggatgttcaa atttgaactc 
acccgccgtc tggagcatga ctttggcaag gtgacgaagc aggaagtcaa agagttcttc 
cgctgggcgc aggatcacgt gaccgaggtg gcgcatgagt tccacgtcag aaagggtgga 



1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3122 



60 
120 
180 
240 
300 
360 
420 
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i 



nccaacaacfa 

vJ ' U-l LA VA VA> V-j LA 


gacccgcccc 


egatgacgeg 


gatataagcg 


agcccaagcg 


ggcctgcccc 


480 


t cagtcgcgg 


atccatcgac 


gtcagacgcg 


gaaggagctc 


cggtggactt 


tgecgacagg 


540 


t a. c caaaaca 


aatgttctcg 


tcacgcgggc 


atgettcaga 


tgctgtttcc 


ctgeaagaca 


600 


fccrrrracracraa 


tgaatcagaa 


tttcaacatt 


tgcttcacgc 


aegggaccag 


agactgttca 


660 


aaatotttcc 


CCCfOCCf tcr tc 

^w\jy vy w v*» 


agaatctcaa 


ccggtcgtca 


gaaagaggac 


gtatcggaaa 


720 


ctctgtgcga 


ttcatcatct 


getggggegg 

3 333333 


gctcccgaga 


ttgettgetc 


ggectgegat 


780 


ctggtcaac g 


tggacctgga 


tgactgtgtt 


tctgagcaat 


aaatgactta 


aaccaggtat 


840 


agctgccgat 


ggttatcttc 


cagattggct 


cgaggacaac 


ctctctgagg 


geattcgega 


900 


atqqtqgqac 


ttgaaacctg 


gagccccgaa 


acccaaagcc 


aaccagcaaa 


agcaggacga 


960 




ctggtgcttc 


ctggctacaa 


gtacctcgga 


cccttcaacg 


gactcgacaa 


1020 


acrcraaaaccc 


ateaacqegq 


cggacgcagc 


ggccctcgag 

3 3 3 3 


cacgacaaag 


cctacgacca 


1080 


crcacrctc a a a 

U tA M w w Vp* LA XA *A 


acaaataaca 
y^yyy I *y aui * 


atccgtacct 


gcggtataat 


cacgccgacg 


ccgagtttca 


1140 


firraaccrtcta 

y y u y ^. y u s 


caaaaaoata 


cgtcttttgg 


gggcaacctc 


gqgcgagcag 

333 3 3 3 


tcttccaggc 


1200 


caaaaaacoa 

^— » a o u u u y v-* y y 


gt tctcgaac 


ctctcggtct 


gattgaggaa 

3 3 ^33 3 


ggegctaaga 

33 3 3 


cggctcctgg 


1260 


a a a. era a. cr acr a 

la la la lj la la y la v-^ la 


ccggt a gage 


agtcgccaca 


agagecagae 


tcctcctcgg 


geateggcaa 


1320 


gaeaggecag 


cagcccgcta 


aaaagagact 


caattttggt 


cagactggcg 


actcagagtc 


1380 


agtccccgac 


ccacaacctc 


teggagaace 


tccagcagcc 


ccctcaggtc 


tgggacctaa 


1440 


tacaatggct 


teaggeggtg 


gcgctccaat ggcagacaat 


aacgaaggcg 


ccgacggagt 


1500 


ggg taat tec 


tegggaaatt 


ggcattgega 


ttccacatgg 


ctgggggaca 

3 3 3 w* 7 


gagtcatcac 


1560 


caccagcacc 


ccr aacctacf cr 

uyuuuui-yyy 


ccctgcccac 


ctacaacaac 


cacctctaca 


agcaaatctc 


1620 


^# ci ci ^ y y w u v-K- 


tccrcrcf acrcf aa 

L*» V# LA 1 VJ U ^ y >»A 


gcaccaacga 


caacacctat 


tttggctaca 


gcaccccctg 


1680 


y y y y u. i» i- 


cract tcaaca 

U V4 ^* w W v LA U. w VA 


gattccactg 


tcacttttca 


ccacgtgact 


ggcaacgact 


1740 


ratcaacaac 

la 1- w CI CA W LA la 


aattggggat 


tccggcccaa 


aagactcaac 


ttcaagctgt 


tcaacatcca 


1800 


crcrtcaacrcraa 


gt cacgacga 


acgaaggcac 


caagaccatc 


gecaataate 


tcaccagcac 


1860 


u u l u w lj y l^ 


tttaeggact 


eggagtacca 


gttacegtae 


gtgetaggat 

3 3 33 


ccgctcacca 


1920 


cr crcr at erfceter 

U u u la u> lj w w xa 


cctccgttcc 


eggeggaegt 


cttcatggtt 


cctcagtacg 


gctatttaac 


1980 


tttaaacaat 


acr aaqccaacf 


ccctgggacg 


ttcctccttc 


tactgtctgg 


agtatttccc 


2040 


atcacaaatcf 


ctgagaaccg 


gcaacaactt 


tcagttcagc 


tacaccttcg 


aggaegtgee 


2100 


ttt ccacagc 


agctacgcgc 


acagecagag 


cctggacagg 


ctgatgaatc 


ccctcatcga 


2160 


ccagtacctg 


tactacctgg 


teagaacgea 


aacgactgga 


actggaggga 


cgcagactct 


zzzv 


ggcattcagc 


caagegggtc 


ctagctcaat 


ggccaaccag 


gctagaaatt 


gggtgcccgg 


2280 


accttgctac 


cggcagcagc 


gcgtctccac 


gacaaccaac 


cagagcaaca 


acagcaactt 


2340 
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tgcctggacg 

cgtggcaatg 

gatttttggc 

agatgaggaa 

catcaacaac 

gattcccggc 

aattcctcac 

gcacccgcct 

cttcaaccag 

ggaaatcgag 

cacttccaac 

tagcgagcct 

tcaataaacc 



ggagctgcca 

gcttcccaca 

aagcaaggag 

gaaatcaagg 

caggccgcca 

atggtgtggc 

acggacggca 

cctcaaattc 

gccaagctga 

tgggagctgc 

tactacaaat 

cgccccattg 

ggttaattcg 



agtttaagct 

aggatgacga 

ccgggaacga 

ctaccaaccc 

atacgcaggc 

agaatagaga 

actttcaccc 

tcatcaagaa 

actctttcat 

agaaagaaaa 

ctacaaatgt 

gcacccgtta 

tttcagttga 



gaacggccga 

cgaccgcttc 

tggagtggat 

cgtggccaca 

gcagaccgga 

cgtgtacctg 

gtctcccctg 

cacaccggtt 

cacgcagtac 

cagcaaacgc 

ggactttgct 

cctcacccgc 

actttggtct 



gactctctaa 

ttcccttcga 

tacagccaag 

gaagaatatg 

ctcgtgcaca 

cagggtccca 

atgggcggct 

ccagcggacc 

agcaccggac 

tggaatccag 

gtcaacacgg 

aacctgtaat 

ctgcgaaggg 



tgaatccggg 

gcggggtcct 

tgctgattac 

gagcagtggc 

accagggggt 

tctgggccaa 

ttggactgaa 

cgccgcttac 

aggtcagcgt 

agattcaata 

aaggagttta 

tacatgttaa 

cgaattc 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3117 



<210> 44 
<211> 3121 
<212> DNA 

<213> new AAV serotype, clone 43.23 
<400> 44 

gaattcgccc ttctacggct gcgtcaactg gaccaatgag aactttccct tcaacgattg 
cgtcgacaag atggtgatct ggtgggagga gggcaagatg acggccaagg tcgtggagtc 
cgccaaggcc attctcggcg gcagcaaggt gcgtgtggac caaaagtgca agtcttccgc 
ccagatcgat cccacccccg tgatcgtcac ctccaacacc aacatgtgcg ccgtgattga 
cgggaacagc accaccttcg agcaccagca gccgttgcag gaccggatgt tcaaatttga 
actcacccgc cgtctggagc atgactttgg caaggtgacg aagcaggaag tcaaagagtt 
cttccgctgg gcgcaggatc acgtgaccga ggtggcgcat gagttccacg tcagaaaggg 
tggcgccaac aagagacccg cccccgatga cgcggatata agcgagccca agcgggcctg 
cccctcagtc gcggatccat cgacgtcaga cgcggaagga gctccggtgg actttgccga 
caggtaccaa aacaaatgtt ctcgtcacgc gggcatgctt cagatgctgt ttccctgcaa 
gacatgcgag agaatgaatc agaatttcaa catttgcttc acgcacggga ccagagactg 
ttcagaatgt ttccccggcg tgtcagaatc tcaaccggtc gtcagaaaga ggacgtatcg 
gaaactctgt gcgattcatc atctgctggg gcgggctccc gagattgctt gctcggcctg 
cgatctggtc aacgtggacc tggatgactg tgtttctgag caataaatga cttaaaccag 
gtatggctgc cgatggttat cttccagatt ggctcgagga caacctctct gagggcattc 
gcgagtggtg ggacttgaaa cctggagccc cgaaacccaa agccaaccag caaaagcagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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acgacggccg gggtctggtg cttcctggct acaagtacct cggacccttc aacggactcg 1020 

acaaggggga gcccgtcaac gcggcggacg cagcggccct cgagcacgac aaagcctacg 1080 

accagcagct caaagcgggt gacaatccgt acctgcggta taatcacgcc gacgccgagt 1140 

ttcaggagc'g tctgcaagaa gatacgtcct ttgggggcaa cctcgggcga gcagtcttcc 1200 

aggccaagaa gcgggttctc gaacctctcg gtctggttga ggaaggcgct aagacggctc 1260 

ctggaaagaa gagaccggta gagcagtcgc cacaagagcc agactcctcc tcgggcatcg 1320 

gcaagacagg ccagcagccc gctaaaaaga gactcaattt tggtcagact ggcgactcag 1380 

agtcagtccc cgacccacaa cctctcggag aacctccagc agccccctca ggtctgggac 1440 

ctaatacaat ggcttcaggc ggtggcgctc caatggcaga caataacgaa ggcgccgacg 1500 

gagtgggtaa ttcctcggga aattggcatt gcgattccac atggctgggg gacagagtca 1560 

tcaccaccag cacccgaacc tgggccctgc ccacctacaa caaccacctc tacaagcaaa 1620 

tctccaacgg cacctcggga ggaagcacca acgacaacac ctattttggc tacagcaccc 1680 

cctgggggta ttttgacttc aacagattcc actgtcactt ttcaccacgt gactggcaac 1740 

gactcatcaa caacaattgg ggattccggc ccaaaagact caacttcaag ctgttcaaca 1800 

tccaggtcaa ggaagtca.cg acgaacgaag gcaccaagac catcgccaat aatctcacca 1860 

gcaccgtgca ggtctttacg gacttggagt accagttacc gtacgtgcta ggatccgctc 1920 

accagggatg tctgcctccg ttcccggcgg acgtcttcat ggttcctcag tacggctatt 1980 

taactttaaa caatggaagc caagccctgg gacgttcctc cttctactgt ctggagtatt 2040 

tcccatcgca gatgccgaga accggcaaca actttcagtt cagctacacc ttcgaggacg 2100 

tgcctttcca cagcagctac gcgcacagcc agagcctgga caggctgatg aatcccctca 2160 

tcgaccagta cctgtactac ctggtcagaa cgcaaacgac tggaactgga gggacgcaga 2220 

ctctggcatt cagccaagcg ggtcctagct caatggccaa ccaggctaga aattgggtgc 2280 

ccggaccttg ctaccggcag cagcgcgtct ccacgacaac caaccagaac aacaacagca 2340 

actttgcctg gacgggagct gccaagttta agctgaacgg ccgagactct ctaatgaatc 2400 

cgggcgtggc aatggcttcc cacaaggatg acgacgaccg cttcttccct tcgagcgggg 24 60 

tcctgatttt tggcaagcaa ggagccggga acgatggagt ggattacagc caagtgctga 2520 

ttacagatga ggaagaaatc aaggctacca accccgtggc cacagaagaa tatggagcag 2580 

tggccatcaa caaccaggcc gccaatacgc aggcgcagac cggactcgtg cacaaccagg 2 64 0 

gggtgattcc cggcatggtg tggcagaata gagacgtgta cctgcagggt cccatctggg 2700 

ccaaaattcc tcacacggac ggcaactttc acccgtctcc cctgatgggc ggctttggac 2760 

tgaagcaccc gcctcctcaa attctcatca agaacacacc ggttccagcg gacccgccgc 2 82 0 

ttaccttcaa ccaggccaag ctgaactctt tcatcacgca gtacagcacc ggacaggtca 2880 
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gcgtggaaat cgagtgggag ctgcagaaag aaaacagcaa acgctggaat ccagagattc 2940 



3000 
3060 



aatacacttc caactactac aaatctacaa atgtggactt tgctgtcaac acggaaggag 
tttatagcga gcctcgcccc attggcaccc gttacctcac ccgcaacctg taattacatg 
ttaatcaata aaccggttaa ttcgtttcag ttgaactttg gtctctgcga agggcgaatt 3120 

3121 

<210> 45 
<211> 3122 
<212> DNA 

<213> new AAV serotype, clone 43.25 



60 



<400> 45 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaagg tgcgtgtgga ccaaaagtgc aagtcttccg 180 

cccagatcga tcccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 

acgggaacag caccaccttc gagcaccagc agccgttgca ggaccggatg ttcaaatttg 300 

aactcacccg ccgtctggag catgactttg gcaaggtgac gaagcaggaa gtcaaagggt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttccac gtgcgagccc 420 

aagcgggcct gcccctcagt cgcggatcca tcgacgtcag accagaaagg gtggagccaa 480 

caagagaccc gcccccgatg acgcggatat aagcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 660 

gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 720 

ggaaactctg tgcgattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 780 

gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 840 
ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaagcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataatcacgc cgacgccgag 1140 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagaccggt agagcagtcg ccacaagagc cagactcctc ctcgggcatc 1320 

ggcaagacag gccagcagcc cgctaaaaag agactcaatt ttggtcagac tggcgactca 1380 

gagtcagtcc ccgacccaca acctctcgga gaacctccag cagccccctc aggtctggga 1440 



900 
960 
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r*r , t"aai~af"3a 




c cr a t cr cr c a c t 


w W \JL UU U w la y 


acaataacga 


aggcgccgac 


1500 




a l i,ol> i.uy yy 


aaa t tggcat 


tgegat tcca 


catggctggg 


ggacagagtc 


1560 


aVraccacca 


y v_ r a w v. w y u. w 


c tgggccctg 


cccacct aca 


acaaccacct 


ctacaagcaa 


1620 


a Lt u o i_ a. a.v_» y 


rrr'^r , r , t"r , rrfTn' 
y LQL.L. Luy y y 


aoaaaticacc 

la y y la la y w la l^ l-» 


aacgacaaca 


cctattttgg 


ctacagcacc 


1680 


r , r , r ,, t*rrrrrTfTff't 


a i» li> uy l-\» 


caacagattc 


cactgtcact 


tttcaccacg 


tgactggcaa 


1740 


O y CL w w w CL w wl 


acaacaattcr 

U w tL W LA CL S* v Vj 


craa att cccra 


cccaaaagac 


tcaacttcaa 


gctgttcaac 


1800 


a 1~ f r* a n rft*ca 


aggaagtc ac 


gacgaacgaa 


ggcaccaaga 


ccatcgccaa 


taatctcacc 


1860 


aercacccrfccic 


aggt cttt ac 


qqactcqqaa 


taccagttac 


cgtacgtgct 


aggatccget 


1920 


caecacrcfcrat 


gtctgcctcc 


crtt ccccr aca 


gaegtcttea 


tggttcctca 


gtaeggctat 


1980 


t*taart"t"taa 


acaatggaag 


ccaagccctg 


ggacgttcct 


ccttctactg 


tctggagtat 


2040 




acratcrctcracr 

ay a i.y uy 


aaccggcaac 


aactttcagt 


tcagctacac 


cttcgaggac 


2100 


crtcrcc tttcc 

y y w w w w 


ac age age ta 


cgcgcacagc 


caqaqcctgg 


acaggctgat 


gaatcccctc 


2160 


atccraccacrt 


acctgtacta 


cctggtcaga 


aegcaaaega 


ctggaactgg 


agggaegcag 


2220 


ar , +*pt*CTcrcat 

uU L-u i»y y ti Ih> 


tcacrccaaac 


aacrtcctaac 

y y y l* w w w y 


teaatggeca 


accaggctag 


aaattgggtg 

7 7 7 7 


2280 


r c c crcf a c c 1 1 

v-* u y h V4» w l# o 


gc taceggea 


gcagcgcgtc 


tccacgacaa 


ccaaccagaa 


caacaacagc 


2340 


aactt'tcrcct 


ggaegggage 


tgccaagttt 


aagctgaacg 


gccgagactc 


tctaatgaat 


2400 


cccrcffjccftcrcr 
y y ? 3 L y y 


caatggcttc 


ccacaaggat 


gacgacgacc 


gcttcttccc 


ttcgagcggg 


2460 


rrtrctciattt 


ttggcaagca 


aggagceggg 


aacaatcrqacf 


tggattacag 


ccaagtgctg 


2520 


at*t*acacfatcf 

CL l« l-Uw ol y CL y 


aggaagaaat caaggctacc 


aaccccgtgg 


ccacagaaga 


atatggagca 


2580 


rrtrrticcatca 
y y y 


acaaccaggc 


cgccaatacg 


caaacqcaqa 

vvA y a w a j 


ccggactcgt 


gcacaaccag 


2640 


rrcrcrcffc crattc 
w yy y y 


ccggcatggt 


gtggcagaat 


acraaacqtqt 


acctgeaggg 


tcccatctgg 


2700 


nrraaaa t"t~ r" 

yi^WClClClClL. 


ctcacacgga 


eggcaacttt 


cacccgtctc 


ccctaataqa 


cggctttgga 


2760 


C UyddyUaixL, 


cgcctcctca 


aattctcatc 


aacraacacac 

U. CL y WL U w WA w U. W 


cggttccagc 


crcraccccrccg 

77 7 7 


2820 




accaggccaa 


gctgaactct 


"H~ ca tea cere 


acrt acacfcac 


eggacaggtc 


2880 


ay uy ty y eta a 


tcgagtggga getgeagaaa 


gaaaacagca 


aacgctggaa 


tccagagatt 


2940 


Uaa Latdu w. u 


ccaactacta 


caaatctaca 


aatgtggact 


ttgctgtcaa 


caccraagqgg 

7 3 7 7 77 


3000 


gtttatagcg 


agcctcgccc 


cattggcacc 


cgttacctca 


cccgcaacct 


gtaattacat 


3060 


gttaatcaat 


aaaccggtta 


attegtttea 


gttgaacttt 


ggtctctgcg 


aagggcgaat 


3120 


tc 












3122 



<210> 46 

<211> 3128 

<212> DNA 

<213> new AAV serotype, clone 44.1 
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<400> 46 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatgttgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaaag tgcgcgtgga ccaaaagtgc aagccgtccg 180 

cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 

acgggaacag caccaccttc gagcaccagc agccgttgcg ggaccggatg ttcaagtttg 300 

aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcagagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca cgagttctac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

aaacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 660 

gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaaa aagacgtatc 720 

ggaaactctg tgcgattcat catctgctgg ggcgggcacc cgagattgct tgctcggcct 780 

gcgatctggt caacgtggac ctagatgact gtgtttctga gcaataaatg acttaaacca 840 

ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 900 

cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 960 

gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 1020 

gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 1080 

gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 1140 

tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 1200 

caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 1260 

cctggaaaga agagaccggt agagccatca ccccagcgtt ctccagactc ctctacgggc 1320 

atcggcaaga aaggccagca gcccgcgaaa aagagactca actttgggca gactggcgac 1380 

tcagagtcag tgcccgaccc tcaaccaatc ggagaacccc ccgcaggccc ctctggtctg 1440 

ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 1500 

gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 1560 

gtcatcacca ccagcacccg aacctgggcc ctccccacct acaacaacca cctctacaag 1620 

caaatctcca acgggacttc gggaggaagc accaacgaca acacctactt cggctacagc 1680 

accccctggg ggtattttga ctttaacaga ttccactgcc acttctcacc acgtgactgg 1740 

cagcgactca tcaacaacaa ctggggattc cggcccaaga gactcaactt caagctcttc 1800 

aacatccagg tcaaggaggt cacgcagaat gaaggcacca agaccatcgc caataacctt 1860 

accagcacga ttcaggtctt tacggactcg gaataccagc tcccgtacgt cctcggctct 1920 
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gcgcaccagg gctgcctgcc tccgttcccg gcggacgtct tcatgattcc tcagtacggg 1980 

tacctgactc tgaacaatgg cagtcaggcc gtgggccgtt cctccttcta ctgcctggag 2040 

tactttcctt ctcaaatgct gagaacgggc aacaactttg agttcagcta ccagtttgag 2100 

gacgtgcctt ttcacagcag ctacgcgcac agccaaagcc tggaccggct gatgaacccc 2160 

ctcatcgacc agtacctgta ctacctgtct cggactcagt ccacgggagg taccgcagga 2220 

actcagcagt tgctattttc tcaggccggg cctaataaca tgtcggctca ggccaaaaac 2280 

tggctacccg ggccctgcta ccggcagcaa cgcgtctcca cgacactgtc gcaaaataac 2340 

aacagcaact gtaaatcccg gtgtcgctat ggcaacccac aaggacgacg aagagcgatt 2400 

ttgcctggac cggtgccacc aagtatcatc tgaatggcag agactctctg ttttccgtcc 2460 

agcggagtct taatgtttgg gaaacaggga gctggaaaag acaacgtgga ctatagcagc 2520 

gttatgctaa ccagtgagga agaaattaaa accaccaacc cagtggccac ggaacagtac 2580 

ggcgtggtgg ccgataacct gcaacagcaa aacgccgctc ctattgtagg ggccgtcaac 2640 

agtcaaggag ccttacctgg catggtctgg cagaaccggg acgtgtacct gcagggtcct 2700 

atctgggcca agattcctca cacggacgga aactttcatc cctcgccgct gatgggaggc 2760 

tttggactga aacacccgcc tcctcagatc ctgattaaga atacacctgt tcccgcggat 2820 

cctccaacta ccttcagtca agctaagctg gcgtcgttca tcacgcagta cagcaccgga 2880 

caggtcagcg tggaaattga atgggagctg cagaaagaaa acagcaaacg ctggaaccca 2 940 
gagattcaat acacttccaa ctactacaaa tctacaaatg tggacttcgc tgttaacaca 
gatggcactt attctgagcc tcgccccatt ggcacccgtt acctcacccg taatctgtaa 

ttgctcgtta atcaataaac cggttgattc gtttcagttg aactttggtc tctgcgaagg 3120 

gcgaattc 3128 

<210> 47 
<211> 3128 
<212> DNA 

<213> new AAV serotype, clone 44.5 
<400> 47 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaaag tgcgcgtgga ccaaaagtgc aagtcgtccg 180 

cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 
acgggaacag caccaccttc gagcaccagc agccgttgca ggaccggatg ttcaagtttg 
aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcagagagt 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca cgagttctac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 



3000 
3060 



300 
360 
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gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 
acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 
aaacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 
gttcagaatg tttccccggc gtgtcagaat ctcaaccggt tgtcagaaaa aagacgtatc 
ggaaactctg tgcgattcat catctgctgg ggcgggcacc cgagattgct tgctcggcct 
gcgatctggt caacgtggac ctagatgact gtgtttctga gcaataaatg acttaaacca 
ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatt 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 
gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 
gacaaggggg agcccgtcaa cgcggcggac gcagcggccc tcgagcacga caaggcctac 
gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc cgacgccgag 
tttcaggagc gtctgcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 
caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 
cctggaaaga agagaccggt agagccatca ccccagcgtt ctccagactc ctctacgggc 
atcggcaaga aaggccagca gcccgcgaaa aagagactca actttgggca gactggcgac 
tcagagtcag tgcccgaccc tcaaccaatc ggagaacccc ccgcaggccc ctctggtctg 
ggatctggta caatggctgc aggcggtggc gctccaatgg cagacaataa cgaaggcgcc 
gacggagtgg gtagttcctc aggaaattgg cattgcgatt ccacatggct gggcgacaga 
gtcatcacca ccagcacccg aacctgggcc ctccccacct acaacaacca cctctacaag 
caaatctcca acgggacttc gggaggaagc accaacgaca acacctactt cggctacagc 
accccctggg ggtattttga ctttaacaga ttccactgcc acttctcacc acgtgactgg 
cagcgactca tcaacaacaa ctggggattc cggcccaaga gacccaactt caagctcttc 
aacatccagg tcaaggaggt cacgcagaat gaaggcacca agaccatcgc caataacctt 
accagcacga ttcaggtctt tacggactcg gaataccagc tcccgtacgt cctcggctct 
gcgcaccagg gctgcctgcc tccgttcccg gcggacgtct tcatgattcc tcagtacggg 
tacctgactc tgaacaatgg cagtcaggcc gtgggccgtt cctccttcta ctgcctggag 
tactttcctt ctcaaatgct gagaacgggc aacaactttg agttcagcta ccagtttgag 
gacgtgcctt ttcacagcag ctacgcgcac agccaaagcc tggaccggct gatgaacccc 
ctcatcgacc agtacctgta ctacctgtct cggactcagt ccacgggagg taccgcagga 
actcagcagt tgctattttc tcaggccggg cctaataaca tgtcggctca ggccaaaaac 
tggctacccg ggccctgcta ccggcagcaa cgcgtctcca cgacactgtc gcaaaataac 
aacagcaact ttgcctggac cggtgccacc aagtatcatc tgaatggcag agactctctg 



540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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gt gtegctat 


ggcaacccac 


aacicracgacg 


aagagegatt 


ttttccgtcc 


2460 


a rrr rrrr a rr t r" t 
ayuyy ay l. 


taatatttcra 


aaaacacrcrcra 


actacraaaag 


acaacgtgga 


ctatagcagc 


2520 


rrtfatcfctaa 

y LLaty w i» C*. d 


ccaataacf oa 


agaaatt aaa 


accaccaacc 


cagtggccac 


agaacagtac 


2580 


ggcgugg i-yy 


cc gataacct 


gcaacagcaa 


aacgccgctc 


ctattgtagg 


ggccgtcaac 


2640 


a. y u u day gay 


cc ttacct ci ci 


catggtctgg 


caaaaccacrg 


acgtgtacct 


gcagggtcct 


2700 


ci lyyy uua 


ag at tec tea 


caeggaegga 


aactttcatc 


cctcgccgct 


gatgggaggc 


2760 


t t"t nrcrac to a 

o o y y y 


aacacccgcc 


tcctcagatc 


ctgattaaga 


atacacctgt 


tcccgcggat 


2820 


rrirraacta 


cctt cagt ca 


agctaagctg 


gcgtcgttca 


teaegcagta 


cagcaccgga 


2880 


r* a rf rrt ca esc a 

u u u l y y 


tggaaatt ga 


atcracragctcf 


cagaaagaaa 


acagcaaacg 


ctggaaccca 


2940 


(T3«aH"^ a a 4" 

y agaLLLdat 


d. L, d l— L L U UQ 


ctactacaaa 

Nfejr CI W W C4 W UUU 


tctacaaatg 


t ggactttgc 


tgttaacaca 


3000 


gatggcactt 


attctgagee 


tcgccccatt 


ggcacccgtt 


acctcacccg 


taa tctgtaa 


JUbU 


ttgcttgtta 


atcaataaac 


eggttgatte 


gtttcagttg 


aactttggtc 


tetgegaagg 


3120 


gegaatte 












3128 



<210> 48 

<211> 1933 

<212> DNA 

<213> new AAV serotype, clone 223.10 
<220> 

<221> misc_f eature 

<222> (1302) . . (1302) 



<223> can 


be a, c, g 


or t 










<400> 48 
caaggcctac 


gaccagcagc 


teaaageggg 


tgacaatccg 


tacctgeggt 


ataaccacgc 


60 


cgacgccgag 


tttcaggagc 


gtcttcaaga 


agataegtet 


tttgggggca 


acctcgggcg 


120 


agcagtcttc 


caggecaaaa 


agegggttet 


cgaacctctt 


ggtctggttg 


agacgccagc 


180 


taagaeggea 


cctggaaaga 


agegaceggt agactcgcca 


gactccacct 


egggcategg 


240 


caagaaaggc 


cagcagcccg 


cgaaaaagag 


actcaacttt 


gggcagactg 


gcgactcaga 


300 


gtcagtcccc 


gaccctcaac 


caateggaga 


accaccagca 


ggcccctctg 


gtctgggatc 


360 


tggtacaatg 


getgeaggeg 


gtggcgcacc 


aatggctgac 


aataacgagg 


gcgccgacgg 


420 


agtgggtaat 


gectcaggaa 


attggcattg 


cgattccaca 


tggctgggcg 


acagagtcat 


480 


caccaccagc 


acccgaacct 


gggccctgcc 


cacctacaac 


aaccacctct 


acaagcaaat 


540 


ctccagtcag tcagcaggga gcaccaacga taaegtctat tteggctaca gcaccccctg 


600 


ggggtatttt 


gacttcaaca 


gattccattg 


ccacttctca 


ccacgtgact 


ggcagegact 


660 


tatcaacaac 


aactggggat 


tccggcccaa 


gaagctcaac 


ttcaagctct 


tcaacatcca 


720 


ggtcaaggag 


gtcacgacga 


atgacggtgt 


cacaaccatc 


gctaataacc 


ttaccagcac 


780 
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ggttcaggtc ttttcggact cggaatatca actgccgtac gtcctcggct ccgcgcacca 840 

gggctgcctg cctccgttcc cggcagacgt gttcatgatt ccgcagtacg gatacctgac 900 

tctgaacaat ggcagccaat cggtaggccg ttcctccttc tactgcctgg agtactttcc 960 

ttctcagatg ctgagaacgg gcaacaactt cacctttagc tacaccttcg aggacgtgcc 1020 

tttccacagc agctacgcgc acagccagag tctggaccgg ctgatgaatc ccctcatcga 1080 

ccagtacctg tactacttgg ccagaacaca gagcaacgca ggaggtactg ctggcaatcg 1140 

ggaactgcag ttttatcagg gcggacctac caccatggcc gaacaagcaa agaactggct 1200 

gcccggacct tgcttccggc aacagagagt atccaagacg ctggatcaaa ataacaacag 1260 

caactttgcc tggactggtg ccacaaaata ccatttaaat gnaagaaatt cattggttaa 1320 

tcccggtgtc gccatggcaa cccacaagga cgacgaggaa cgcttcttcc cttcgagcgg 1380 

agttctaatt tttggcaaaa ctggagcagc taataaaact acattagaaa acgtgctcat 1440 

gacaaatgaa gaagaaattc gtcctaccaa cccggtagct accgaggaat acgggattgt 1500 

aagcagcaac ttgcaggcgg ctagcaccgc agcccagaca caagttgtta acaaccaggg 1560 

agccttacct ggcatggtct ggcagaaccg ggacgtgtac ctgcaaggtc ccatttgggc 1620 

caagattcct cacacggacg gcaactttca cccgtctcct ctaatgggtg gctttggact 1680 

gaaacacccg cctccccaga tcctgatcaa aaacacaccg gtacctgcta atcctccaga 1740 

agtgtttact cctgccaagt ttgcttcctt catcacgcag tacagcaccg ggcaagtcag 1800 

cgttgagatc gagtgggagc tgcagaaaga gaacagcaag cgctggaacc cagagattca 1860 

gtacacctcc aactttgaca aacagactgg agtggacttt gctgttgaca gccagggtgt 1920 

ttactctgag cct 1933 

<210> 49 
<211> 1933 
<212> DNA 

<213> new AAV serotype, clone 223.2 
<400> 49 

caaggcctac gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc 60 

cgacgccgag tttcaggagt gtcttcaaga agatacgtct tttgggggca acctcgggcg 120 

agcagtcttc caggccaaaa agcgggttct cgaacctctt ggtctggttg agacgccagc 180 

taagacggca cctggaaaga agcgaccggt agactcgcca gactccacct cgggcatcgg 240 

caagaaaggc cagcagcccg cgaaaaagag actcaacttt gggcagactg gcgactcaga 300 

gtcagtcccc gaccctcaac caatcggaga accaccagca ggcccctctg gtctgggatc 360 

tggtacaatg gttgcaggcg gtggcgcacc aatggctgac aataacgagg gcgccgacgg 420 

agtgggtaat gcctcaggaa attggcattg cgattccaca tggctgggcg acagagtcat 480 

caccaccagc acccgaacct gggccctgcc cacctacaac aaccacctct acaagcaaat 540 



89 



WO 03/042397 



PCT/US02/33629 



ctccagtcag tcagcaggga gcaccaacga taacgtctat ttcggctaca gcaccccctg 600 

ggggtatttt gacttcaaca gattccattg ccacttctca ccacgtgact ggcagcgact 660 

tatcaacaac aactggggat tccggcccaa gaagctcaac ttcaagctct tcaacatcca 720 

ggtcaaggag gtcacgacga atgacggtgt cacaaccatc gctaataacc ttaccagcac 780 

ggttcaggtc ttttcggact cggaatatca actgccgtac gtcctcggct ccgcgcacca 840 

gggctgcctg cctccgttcc cggcagacgt gttcatgatt ccgcagtacg gatacctgac 900 

tctgaacaat ggcagccaat cggtaggccg ttcctccttc tactgcctgg agtactttcc 960 

ttctcagatg ctgagaacgg gcaacaactt cacctttagc tacaccttcg aggacgtgcc 1020 

tttccacagc agctacgcgc acagccagag tctggaccgg ctgatgaatc ccctcatcga 1080 

ccagtacctg tactacttgg ccagaacaca gagcaacgca ggaggtactg ctggcaatcg 1140 

ggaactgcag ttttatcagg gcggacctac caccatggcc gaacaagcaa agaactggct 1200 

gcccggacct tgcttccggc aacagagagt atccaagacg ctggatcaaa ataacaacag 1260 

caactttgcc tggactggtg ccacaaaata ccatttaaat ggaagaaatt cattggttaa 1320 

tcccggtgtc gccatggcaa cccacaagga cgacgaggaa cgcttctccc cttcgagcgg 1380 

agttctaatt tttggcaaaa ctggagcagc taataaaact acattagaaa acgtgctcat 1440 

gacaaatgaa gaagaaattc gtcctaccaa cccggtagct accgaggaat acgggattgt 1500 

aagcagcaac ttgcaggcgg ctagcaccgc agcccagaca caagttgtta acaaccaggg 1560 

agccttacct ggcatggtct ggcagaaccg ggacgtgtac ctgcaaggtc ccatttgggc 1620 

caagattcct cacacggacg gcaactttca cccgtctcct ctaatgggtg gctttggact 1680 

gaaacacccg cctccccaga tcctgatcaa aaacacgccg gtacctgcta atcctccaga 1740 

agtgtttact cctgccaagt ttgcttcctt catcacgcag tacagcaccg ggcaagtcag 1800 

cgttgagatc gagtgggagc tgcagaaaga gaacagcaag cgctggaacc cagagattca 1860 

gtacacctcc aactttgaca aacagactgg agtggacttt gctgttgaca gccagggtgt 1920 

ttactctgag cct 193 3 

<210> 50 
<211> 1933 
<212> DNA 

<213> new AAV serotype, clone 223.4 
<400> 50 

caaggcctac gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc 60 

cgacgccgag tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg 120 

agcagtcttc caggccaaaa agcgggttct cgaacctctt ggtctggttg agacgccagc 180 

taagacggca cctggaaaga agcgaccggt agactcgcca gactccacct cgggcatcgg 240 

caagaaaggc cagcagcccg cgaaaaagag actcaacttt gggcagactg gcgactcaga 300 
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gccagtcccc 


gaccctcaac 


caateggaga 


accaccagca 


ggcccctctg 


gtctgggatc 


360 


t ggt acaatg 


actacaggcg 


gtggcgcacc 


aatggctgac 


aataacgagg 


gcgccgacgg 


420 


acrtcrcf otaat 


gcctcaggaa 


attggcattg 


cgattccaca 


cggctgggcg 


acagagtcat 


480 


rarraccaoc 


acccgaacct 


gggccctgcc 


cacctacaac 


aaccacctct 


acaagcaaat 


540 


r 1" r* c a cr t c a cr 


tcaacaacraa 


gcaccaacga 


taaegtctat 


tteggctaca 


gcaccccctg 


600 


aaacrtatttt 


gacttcaaca 


gattccattg 


ccacttctca 


ccacgtgact 


ggcagegact 


660 


t atcaacaac 


aactqqggat 


tccggcccaa 


gaagctcaac 


ttcaagctct 


tcaacatcca 


720 


cr cr t c a a cr cr a cr 

\j y w w t#4 y y y 


gtcacgacga 


atgacqgcqt 


cacaaccatc 


gctaataacc 


ttaccagcac 


780 


crattcacrcrtc 

y y w u» w y y w 


ttttcggact 


eggaatatea 


actgccgtac 


gtcctcggct 


ccgcgcacca 


840 


gggct gcctg 


cctccgttcc 


eggcagaegt 


gttcatgatt 


ccgcagtacg 


gatacctgac 


900 


tctgaacaat 


ggcagccaat 


eggtaggecg 

3 17 3 3 3 


ttcctccttc 


tactgectgg 


agtactttcc 


960 


ttctcagatg 


ctgaqaacgg 

3 3 3 3 


gcaacaactt 


cacctttagc 


tacaccttcg 


aggaegtgee 


1020 


tttccacagc 


agctacgcgc 


acagecagag 


tetgggcegg 

3 3 3 3 3 


ctgatgaatc 


ccctcatcga 


1080 


ccaatacctcr 


tactacttgg 


ccagaacaca 


gagcaacgea 


ggaggtactg 


ctggcaatcg 


1140 


rrrfaartcrcaef 

U y uUU y ci y 


ttttat cacrcr 


gcggacctac 


caccatggcc 


gaacaagcaa 


agaactggct 


1200 


gcccggacct 


tgcttccggc 


aacagagagt 


atccaagacg 


ctggatcaaa 


ataacaacag 


1260 


caactttgcc 


tqqactggtg 


ccacaaaata 


ccatttaaat 


ggaagaaatt 


cattggttaa 


1320 


tcccggtgtc 


gccatggcaa 


cccacaagga 


cgacgaggaa 


cgcttcttcc 


ettcgagegg 


1380 


agttctaatt 


tttggcaaaa 


ctggagcagc 


taataaaact 


acattagaaa 


aegtgetcat 


1440 


gacaaatgaa gaagaaattc gtcctaccaa cccggtagct 


accgaggaat 


aegggattgt 

3 3 3 3 


1500 


aagcagcaac 


ttgeaggegg 


ctagcaccgc 


agcccagaca 


caagttgtta 


acaaccaggg 


1560 


agccttacct 


ggcatggtct 


ggcagaaccg 


ggacgtgtac 


ctgcaaggtc 


ccatttgggc 


1620 


caagattcct 


cacacggacg 


gcaactttca 


cccgtctcct 


ctaatgggtg 


gctttggact 


1680 


gaaacacccg 


cctccccaga 


tcctgatcaa 


aaacacaccg 


gtacctgeta 


atcctccaga 


1740 


agtgtttact 


cctgccaagt 


ttgettcett 


catcacgcag 


tacagcaccg 


ggcaagtcag 


1800 


cgttgagatc 


gaatgggagc 


tgcagaaaga 


gaacagcaag 


cgctggaacc 


cagagattca 


i q c r\ 


gtacacctcc 


aactttgaca 


aacagactgg 


agtggacttt 


gctgttgaca 


gccagggtgt 


1920 


ttactctgag 


cct 










1933 



<210> 51 
<211> 1933 
<212> DNA 

<213> new AAV serotype, clone 223.5 
<400> 51 

caaggcctac gaccagcagc teaaageggg tgacaatccg tacctgeggt ataaccacgc 60 
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cgacgccgag 


tttcaggagc 


gtcttcaaga 


agataegtet 


tttgggggca 


acctcgggcg 


12 0 


agcag tc ttc 


caggecaaaa 


agegggttet 

3 3 3 3 


cgaacctctt 


ggtctggttg 


agacgccagc 


180 


taaaacacrca 


cctggaaaga 


agegaceggt 

3 3 33 


agactcgcca 


gactccacct 


egggcategg 


240 


caacraaacrcrc 


cagcagcccg 


cgaaaaagag 


actcaacttt 


gggcagactg 

3 3 3 3 3 


gcgactcaga 


300 




gaccctcaac 


caateggaga 


accaccagca 


ggcccctctg 


gtctgggatc 


360 


tggtacaat g 


actcrcacrgccr 


gtggcgcacc 


aatggctgac 


aataacgagg 


gcgccgacgg 


420 


agtgggtaat 


gectcaggaa 


attggcattg 


cgattccaca 


cggctgggcg 


acagagtcat 


480 


caccaccagc 


acccgaacct 


gggccctgcc 


cacctacaac 


aaccacctct 


acaagcaaat 


540 


c tccagtcag 


tcaacaggga 


gcaccaacga 


taaegtctat 


tteggctaca 


gcaccccctg 


600 


ggggt at ttt 


gacttcaaca 


gattccattg 


ccacttctca 


ccacgtgact 


ggcagegact 


660 


tatcaacaac 


aactggggat 

3 Z3 33 3 


tccggcccaa 


gaagctcaac 


ttcaagctct 


tcaacatcca 


720 


ggtcaaggag 


gtcacgacga 


atgacggcgt 


cacaaccatc 


gctaataacc 


ttaccagcac 


780 


ggttcaggtc 


tttteggact 


eggaatatea 


actgccgtac 


gtcctcggct 


ccgcgcacca 


840 


acrcrctcrcctg 


cctccgttcc 


eggcagaegt 


gttcatgatt 


ccgcagtacg 


gatacctgac 


900 


tctgaacaat 


ggcagccaat 


eggtaggecg 


ttcctccttc 


tactgectgg 


agtactttcc 


960 


ttctcagatg 


ctgagaaegg 


gcaacaactt 


cacctttagc 


tacaccttcg 


aggaegtgee 


1020 


tttccacagc 


agctacgcgc 


acagecagag 


tetgggcegg 


ctgatgaatc 


ccctcatcga 


1080 


ccagtacct g 


tactacttgg 


ccagaacaca 


gagcaacgea 


ggaggtactg 


ctggcaatcg 


1140 


aaaactgcacr 


ttttatcagg 


gcggacctac 


caccatggcc 


gaacaagcaa 


agaactggct 


1200 


gcccggacc t 


tgcttccggc 


aacagagagt 


atccaagacg 


ctggatcaaa 


ataacaacag 


1260 


caactttgcc 


tggactggtg 

3 3 33 3 


ccacaaaata 


ccatttaaat 


ggaagaaatt 


cattggttaa 


1320 


tcccggtgtc 


gccatggcaa 


cccacaagga 


cgacgaggaa 

3 3 3 3 


cgcttcttcc 


ettcgagegg 


1380 


agt tctaatt 


tttggcaaaa 


ctggagcagc 

33 3 3 


taataaaact 


acattagaaa 


aegtgetcat 


1440 


gacaaatgaa 


gaagaaattc 


gtcctaccaa 


cccggtagct 


accgaggaat 


aegggattgt 


1500 


a age age a a c 


ttacaggcag 

^ 3 ww * , 3 3 w 3 3 


ctagcaccgc 


agcccagaca 


caagttgtta 


acaaccaggg 


1560 


agccttacct 


aacatggtct 

3 3 ^3 3 


ggcagaaccg 


ggacgtgtac 


ctgcaaggtc 


ccatttgggc 


1620 


c a a gat t cc t 


cacacggacg 


gcaactttca 


cccgtctcct 


ctaatgggtg 


gctttggact 


1680 


gaaacacccg 


cctccccaga 


tcctgatcaa 


aaacacaccg 


gtacctgeta 


atcctccaga 


1740 


agtgtttact 


cctgccaagt 


ttgettcett 


catcacgcag 


tacagcaccg 


ggcaagtcag 


1800 


cgttgagatc 


gaatgggagc 


tgcagaaaga 


gaacagcaag 


cgctggaacc 


cagagattca 


1 O C A 

1860 


gtacacctcc 


aactttgaca 


aacagactgg 


agtggacttt 


gctgttgaca 


gccagggtgt 


1920 


ttactctgag 


cct 










1933 
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<210> 52 

<211> 1933 

<212> DNA 

<213> new AAV serotype, clone 223.6 

<400> 52 



caaggcctac 


gaccagcagc 


tcaaagcggg tgacaatccg 


tacctgcggt 


ataaccacgc 


60 


cgacgccgag 


tttcaaaacrc 


gtcttcaaga agatacgtct 


tttgggggca 


acctcgggcg 


120 


agcagtc ttc 


caggccaaaa 


agcgggttct cgaacctctt 


ggtctggttg 


agacgccagc 


180 


t aagacggca 


cctggaaaga 


agcgaccggt agactcgcca gactccacct 


cgggcatcgg 


240 


caagaaaggc 


cagcagcccg 


cgaaaaagag actcaacttt 


gggcagactg 


gcgactcaga 


300 


gtcagtcccc 


gaccctcaac 


caatcggaga accaccagca 


ggcccctctg 


gtctgggatc 


360 


tggtacaatg 


gctgcaggcg 


gtggcgcacc aatggctgac 


aatagcgagg 


gcgccgacgg 


420 


aatqggtaat 


gcctcaggaa 


attggcattg cgattccaca 


tggctgggcg 


acagagtcat 


480 


caccaccagc 


acccgaacct 


gggccctgcc cacctacaac 


aaccacctct 


acaagcaaat 


540 


ctccagtcag 


tcagcaggga 


gcaccaacga taacgtctat 


ttcggctaca 


gcaccccctg 


600 


ggggt at t t t 


gacttcaaca 


gattccattg ccacttctca 


ccacgtgact 


ggcagcgact 


660 


tat caacaac 


aactggggat 


tccggcccaa gaagctcaac 


ttcaagctct 


tcaacatcca 


720 


ggtcaaggag 


gtcacgacga 


atgacggtgt cacaaccatc 


gctaataacc 


ttaccagcac 


780 


ggttcaggtc 


ttttcggact 


cggaatatca actgccgtac 


gtcctcggct 


ccgcgcacca 


840 


aacrctcrcctq 


cctccgttcc 


cggcagacgt gttcatgatt 


ccgcagtacg 


gatacctgac 


900 


tctgaacaat 


ggcagccaat 


cggtaggccg ttcctccttc 


tactgcctgg 


agtactttcc 


960 


u ll- uy u u y 


rt cracf aacaff 


gcaacaactt cacctttagc 


tacaccttcg 


aggacgtgcc 


1020 




acrct accrccrc 

u. y w u w y y ^ 


acagccagag tctggaccgg 


ctgatgaatc 


ccctcatcga 


• 1080 


rcaotacctcf 


tactacttgg 


ccagaacaca gagcaacgca 


ggaggtactg 


ctggcaatcg 


1140 


cr era ac t o c a a 

U U U, Li W w W 


ttttatcagg 


gcggacctac caccatggcc 


gaacaagcaa 


agaactggct 


1200 


gcccggacc t 


tgcttccggc 


aacagagagt atccaagacg 


ctggatcaaa 


ataacaacag 


1260 


caactttgcc 


tggactggtg 


ccacaaaata ccatttaaat 


ggaagaaatt 


cattggttaa 


1320 


tcccggtgtc 


gccatggcaa 


cccacaagga cgacgaggaa 


cgcttcttcc 


cttcgagcgg 


1380 


agttctaatt 


tttggcaaaa 


ctggagcagc taataaaact 


acattagaaa 


acgtgctcat 


1 4 4 D 


gacaaatgaa 


gaagaaattc 


gtcctaccaa cccggtagct 


accgaggaat 


acgggattgt 


1500 


aagcagcaac 


ttgcaggcgg 


ctagcaccgc agcccagaca 


caagttgtta 


acaaccaggg 


1560 


agccttacct 


ggcatggtct 


ggcagaaccg ggacgtgtac 


ctgcaaggtc 


ccatttgggc 


1620 


caagattcct cacacggacg gcaactttca cccgtctcct ctaatgggtg gctttggact 


1680 


gaaacacccg 


cctccccaga 


tcctgatcaa aaacacaccg 


gtacctgcta 


atcctccaga 


1740 



93 



WO 03/042397 



PCT/US02/33629 



agtgtttact cctgccaagc ttgcttcctt catcacgcag tacagcaccg ggcaagtcag 1800 

cgttgagatc gagtgggagc tgcagaaaga gaacagcaag cgctggaacc cagagattca 1860 

gtacacctcc aactttgaca aacagactgg agtggacttt gctgttgaca gccagggtgt 1920 

ttactctgag cct 1933 

<210> 53 
<211> 1933 
<212> DNA 

<213> new AAV serotype, clone 223.7 
<400> 53 

caaggcctac gaccagcagc tcaaagcggg tgacaatccg tacctgcggt ataaccacgc 60 

cgacgccgag tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg 120 

agcagtcttc caggccaaaa agcgggttct cgaacctctt ggtctggttg agacgccagc 180 

taagacggca cctggaaaga agcgaccggt agactcgcca gactccacct cgggcatcgg 240 

caagaaaggc cagcagcccg cgaaaaagag actcaacttt gggcagactg gcgactcaga 300 

gtcagtcccc gaccctcaac caatcggaga accaccagca ggcccctctg gtctgggatc 360 

tggtacaatg gctgcaggcg gtggcgcacc aatggctgac aataacgagg gcgccgacgg 420 

agtgggtaat gcctcaggaa attggcattg cgattccaca tggctgggcg acagagtcat 480 

caccaccagc acccgaacct gggccctgcc cacctacaac aaccacctct acaagcaaat 540 

ctccagtcag tcagcaggga gcaccaacga taacgtctat ttcggctaca gcaccccctg 600 

ggggtatttt gacttcaaca gattccattg ccacttctca ccacgtgact ggcagcgact 660 

tatcaacaac aactggggat tccggcccaa gaagctcaac ttcaagctct tcaacatcca 720 

ggtcaaggag gtcacgacga atgacggcgt cacaaccatc gctaataacc ttaccagcac 780 

ggttcaggtc ttttcggacc cggaatatca actgccgtac gtcctcggct ccgcgcacca 840 

gggctgcctg cctccgttcc cggcagacgt gttcatgatt ccgcagtacg gatacctgac 900 

tctgaacaat ggcagccaat cggtaggccg ttcctccttc tactgcctgg agtactttcc 960 

ttctcagatg ctgagaacgg gcaacaactt cacctttagc tacaccttcg aggacgtgcc 1020 

tttccacagc agctacgcgc acagccagag tctggaccgg ctgatgaatc ccctcatcga 1080 

ccagtacctg tactacttgg ccagaacaca gagcaacgca ggaggtactg ctggcaatcg 1140 

ggaactgcag ttttatcagg gcggacctac caccatggcc gaacaagcaa agaactggct 1200 

gcccggacct tgcttccggc aacagagagt atccaagacg ctggatcaaa ataacaacag 1260 

caactttgcc tggactggtg ccacaaaata ccatttaaat ggaagaaatt cattggttaa 1320 

tcccggtgtc gccatggcaa cccacaagga cgacgaggaa cgcttcttcc cttcgagcgg 1380 

agttctaatt tttggcaaaa ctggagcagc taataaaact acattagaaa acgtgctcat 1440 

gacaaatgaa gaagaaattc gtcctaccaa cccggtagct accgaggaat acgggattgt 1500 
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aagcagcaac ttgcaggcgg ctagcaccgc agcccagaca caagttgtta acaaccaggg 1560 

agccttacct ggcatggtct ggcagaaccg ggacgtgtac ctgcaaggtc ccatttgggc 1620 

caagattcct cacacggacg gcaactttca cccgtctcct ctaatgggtg gctttggact 1680 

gaaacacccg cctccccaga tcctgatcaa aaacacaccg gtacctgcta atcctccaga 174 0 

agtgtttact cctgccaaga ttgcttcctt catcacgcag tacagcaccg ggcaagtcag 1800 

cgttgagatc gagtgggagc tgcagaaaga gaacagcaag cgctggaacc cagagattca 18 60 

gtacacctcc aactttgaca aacagactgg agtggacttt gctgttgaca gccagggtgt 1920 

ttactctgag cct 1933 

<210> 54 
<211> 3123 
<212> DNA 

<213> new AAV serotype, clone A3 . 4 
<400> 54 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga aaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggaaagat gaccgccaag gtcgtggaat 120 

ctgccaaagc cattctgggt ggaagcaagg ttcgtgtgga ccagaaatgc aagtcttcgg 180 

cccagatcga cccgactccg gtgattgtca cctctaacac caacatgtgc gccgtgattg 240 

acggaaactc gaccaccttc gagcaccagc agccgttgca agaccggatg ttcaaatttg 300 

aacttacccg ccgtttggat catgactttg ggaaggtcac caagcaggaa gtcaaagact 360 

ttttccggtg ggctcaagat cacgtgactg aggtggagca tgagttctac gtcaaaaagg 420 

gtggagccaa gaaaaggccc gcccccgatg atgtatatat aaatgagccc aagcgggcgc 480 

gcgagtcagt tgcgcagcca tcgacgtcag acgcggaagc ttcgataaac tacgcgggca 540 

ggtaccaaaa caaatgttct cgtcacgtgg gcatgaatct gatgctgttt ccctgtcgac 600 

aatgcgaaag aatgaatcag aattcaaata tctgcttcac acacgggcaa aaagactgtt 660 

tggaatgctt tcccgtgtca gaatctcaac ccgtttctgt cgtcagaaaa acgtatcaga 720 

aactttgtta cattcatcat atcatgggaa aagaaccaga cgcctgcact gcctgcgacc 780 
tggtaaatgt ggacttggat gactgtattt ctgagcaata aatgacttaa atcaggtatg 
gctgctgacg gttatcttcc agattggctc gaggacactc tctctgaagg aatcagacag 
tggtggaagc tcaaacctgg cccaccaccg ccgaaaccta accaacaaca ccgggacgac 

agtaggggtc ttgtgcttcc tgggtacaag tacctcggac ccttcaacgg actcgacaaa 1020 

ggagagccgg tcaacgaggc agacgccgcg gccctcgagc acgacaaagc ctacgaccac 1080 

cagctcaagc aaggggacaa cccgtacctc aaatacaacc acgcggacgc tgaatttcag 1140 

gagcgtcttc aagaagatac gtctttcggg ggcaacctcg ggcgagcagt cttccaggcc 1200 

aaaaagaggg tactcgagcc tcttggtctg gttgaggaag ctgttaagac ggctcctgga 1260 



840 
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aaaaagagac 


ctatagagca 


gtctcctgca 


gaaceggact 


cttcctcggg 


categgegaa 


1320 


tcaggccagc 


agcccgct aa 


gaaaagactc 


aattttggtc 


agactggega 

3 3 3 3 


cacagagtca 


1380 


gtcccagacc 


ctcaaccaat 


cggagaaccc 


cccgcagccc 


cctctggtgt 


gggatctaat 


1440 


acaataactt 


caaaca atcrcr 

w u. y y y h y y 


aacaccaata 


gcagacgata 


aegaaggege 


egaeggagtg 

-3333 3 


1500 


ggt aattcct 


caaaaaatta 

W M M H *•* w w y 


geattgegat 


tccacatgga 


tgagcgacag 

33 3 3 3 


agttatcacc 


1560 


accagcacaa 


gaacctgggc 


cctccccacc 


tacaataatc 


acctctacaa 


gcaaatctcc 


1620 


agcgaatcgg 

3 3 3 3 


gagccaccaa 


cgacaaccac 


tacttegget 


acagcacccc 


ctgggggtat 


1680 


tttgacttta 


acagattcca 


ctgtcacttc 


tcaccacgtg 


actggcagcg 


actcatcaac 


1740 


aacaactggg 


gatttagacc 


caagaaactc 


aatttcaagc 


tcttcaacat 


ccaagtcaag 


1800 


tracrcftcaccrc 

M y sJ \m w t-4. w y 


agaatgat gg 


aaccacgacc 


ategecaata 


acctt accag 


cacggtgcag 


1860 


gtcttcacag 


actc tgagt a 


ccagctgccc 


tacgtcctcg 


gtteggctea 


ccagggctgc 

w w 3 3^3 


1920 


c tt ccgccgt 


tcccagcaga 


cgt cttcatg 


attcctcagt 


aeggctaett 


gactctgaac 


1980 


aatggcagcc 


aageggtagg 


aegttcttea 


ttctactgtc 


tagagtattt 


tccctctcag 


2040 


atgctgagga 


egggaaacaa 


cttcaccttc 


agctacactt 


ttgaagacgt 


gcctttccac 


2100 


agcagctacg 


cgcacagcca 


gagtctggat 


eggctgatga 


atcctctcat 


tgaccagtac 


2160 


ctgtattacc 


tgagcaaaac 


tcagggtaca 


agtggaacaa 


cgcagcaatc 


gagactgeag 


2220 


t tcagccaag 


ctcragcct acr 


ctccatggct 


cagcaggcca 


aaaactggct 


accgggaccc 


2280 


agctaccgac 


ageagegaat 


gtctaagacg 


gctaatgaca 


acaacaacag 


tgaatttget 


2340 


t ggactgcag 


ccaccaaata 


ttacctgaat 


ggaagaaatt 


ctctggtcaa 


tcccgggccc 


2400 


ccaatggcca 


gtcacaagga 


caataaaaaa 
3"^yyy **** 


aagtatttcc 


ccatgcacgg 


aaatctcatc 


2460 


tttggaaaac 


aaaacacacra 

^ J ^ 3 3 


aactaccaat 


gtggacattg 

3 33 3 


aatcagtgct 


tattacagac 


2520 


gaagaagaaa 


tcagaacaac 


taatcctgtg 


gctacagaac 


aataeggaca ggttgccacc 


2580 


aaccatcaga 


gtcaggacac 


cacagcttcc 


tatggaagtg 


tggacageca 


gggaatctta 


2640 


cct ggaatgg 


tgtggcagga 


ccgcgatgtc 


tatcttcaag 


gtcccatttg 


ggecaaaact 


2700 


cct cacacgg 


aeggacac tt 


tcatccttct 


ccgctcatgg 


gaggctttgg 


actgaaacac 


2760 


cctcctcccc 


agatcctgat 


caaaaacaca 


cctgtgccag 


cgaatcccgc 


gaccactttc 


2820 


actcctncraa 


agtttgc ttc 


gttcattacc 


cagtattcca 


ccggacaggt 


cagcgtggaa 


2880 


atacraat crao 

t*. Vfc VH »H w *H *H Vj 


aorctcrcacraa 


agaaaacagc 


aaacgctgga 


acccagaaat 


tcagtacacc 


2940 


tccaactaca 


acaacrtccr crt 


gaatgtggag 


tttaccgtgg 


aegcaaaegg 


tgtttattct 


3000 


gaaccccgcc 


ctattggcac 


tegttacett 


acccggaact 


tgtaatttcc 


tgttaatgaa 


3060 


taaaccgatt 


tatgegttte 


agttgaactt 


tggtctctgc 


gaagggcgaa 


ttcgcggccg 


3120 


eta- 












3123 
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<210> 55 
<211> 3113 
<212> DNA 

<213> new AAV serotype, clone A3 . 5 
<400> 55 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga aaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggaaagat gaccgccaag gtcgtggaat 120 

ctgccaaagc cattctgggt ggaagcaagg ttcgtgtgga ccagaaatgc aagtcttcgg 180 

cccagatcga cccgactccg gtgattgtca cctctaacac caacatgtgc gccgtgattg 240 

acggaaactc gaccaccttc gagcaccagc agccgttgca agaccggatg ttcaaatttg 300 

aacttacccg ccgtttggat catgactttg ggaaggtcac caagcaggaa gtcaaagact 360 

ttttccggtg ggctcaagat cacgtgactg aggtggagca tgagttctac gtcaaaaagg 420 

gtggagccaa gaaaaggccc gcccccgatg atgtatatat aaatgagccc aagcgggcgc 480 

gcgagtcagt tgcgcagcca tcgacgtcag acgcggaagc ttcgataaac tacgcggaca 540 

ggtaccaaaa caaatgttct cgtcacgtgg gcatgaatct gatgctgttt ccctgtcgac 600 

aatgcgaaag aatgaatcag aattcaaata tctgcttcac acacgggcaa aaagactgtt 660 

tggaatgctt tcccgtgtca gaatctcaac ccgttcctgt cgtcagaaaa acgtatcaga 720 

aactttgtta cattcatcat atcatgggaa aagtaccaga cgcctgcact gcctgcgacc 780 

tggtaaatgt ggacttggat gactgtattt ctgagcaata aatgacttaa atcaggtatg 840 

gctgctgacg gttatcttcc agattggctc gaggacactc tctctgaagg aatcagacag 900 

tggtggaagc tcaaacctgg cccaccaccg ccgaaaccta accaacaaca ccgggacgac 960 

agtaggggtc ttgtgcttcc tgggtacaag tacctcggac ccttcaacgg actcgacaaa 1020 

ggagagccgg tcaacgaggc agacgccgcg gccctcgagc acgacaaagc ctacgaccac 1080 

cagctcaagc aaggggacaa cccgtacctc aaatacaacc acgcggacgc tgaatttcag 1140 

gagcgtcttc aagaagatac gtctttcggg ggcaacctcg ggcgagcagt cttccaggcc 1200 

aaaaagaggg tactcgagcc tcttggtctg gttgaggaag ctgttaagac ggctcctgga 1260 

aaaaagagac ctatagagca gtctcctgca gaaccggact cttcctcggg catcggcaaa 1320 

tcaggccagc agcccgctaa gaaaagactc aattttggtc agactggcga cacagagtca 1380 

gtcccagacc ctcaaccaat cggagaaccc cccgcagccc cctctggtgt gggatctaat 1440 

acaatggctt caggcggtgg ggcaccaatg gcagacaata acgaaggcgc cgacggagtg 1500 

ggtaattcct cgggaaattg gcattgcgat tccacatgga tgggcgacag agttatcacc 1560 

accagcacaa gaacctgggc cctccccacc tacaataatc acctctacaa gcaaatctcc 1620 

agcgaatcgg gagccaccaa cgacaaccac tacttcggct acagcacccc ctgggggtat 1680 

tttgacttta acagattcca ctgtcacttc tcaccacgtg actggcagcg actcatcaat 1740 
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aacaactggg gatttagacc caagaaactc aatttcaagc tcttcaacat ccaagtcaag 
gaggtcacgc agaatgatgg aaccacgacc atcgccaata accttaccag cacggtgcag 
gtcttcacag actctgagta ccagctgccc tacgtcctcg gttcggctca ccagggctgc 
cttccgccgt tcccagcaga cgtcttcatg attcctcagt acggctactt gactctgaac 
aatggcagcc aagcggtagg acgttcttca ttctactgtc tagagtattt tccctctcag 
atgctgagga cgggaaacaa cttcaccttc agctacactt ttgaagacgt gcctttccac 
agcagctacg cgcacagcca gagtctggat cggctgatga atcctctcat tgaccagtac 
ctgtattacc tgagcaaaac tcagggtaca agtggaacaa cgcagcaatc gagactgcag 
ttcaaccaag ctgggcctag ctccatggct cagcaggcca aaaactggct accgggaccc 
agctaccgac agcagcgaat gtctaagacg gctaatgaca acaacaacag tgaatttgct 
tggactgcag ccaccaaata ttacccgaat ggaagaaatt ctctggtcaa tcccgggccc 
ccaatggcca gtcacaagga cgatgaggaa aagtatttcc ccatgcacgg aaatctcatc 
tttggaaaac aaggcacagg aactaccaat gtggacattg aatcagtgct tattacagac 
gaagaagaaa tcagaacgac taatcctgtg gctacagaac aatacggaca ggttgccacc 
aaccgtcaga gtcagaacac cacagcttcc tatggaagtg tggacagcca gggaatctta 
cctggaatgg tgtggcagga ccgcgatgtc tatcttcaag gtcccatttg ggccaaaact 
cctcacacgg acggacactt tcatccttct ccgctcatgg gaggctttgg actgaaacac 
cctcctcccc agatcctgat caaaaacaca cctgtgccag cgaatcccgc gaccactttc 
actcctggaa agtttgcttc gttcattacc cagtattcca ccggacaggt cagcgtggaa 
atagagtggg agctgcagaa agaaaacagc aaacgctgga acccggaaat tcagtacacc 
tccaactaca acaagtcggt gaatgtggag tttaccgtgg acgcaaacgg tgtttattct 
gaaccccgcc ctattggcac tcgttacctt acccggaact tgtaatttcc tgttaatgaa 
taaaccgatt tatgcgtttc agttgaactt tggtctctgc gaagggcgaa ttc 

<210> 56 
<211> 3122 
<212> DNA 

<213> new AAV serotype, clone A3. 7 
<400> 56 

agcggccgcg aattcgccct ttctacggct gcgtcaactg gaccaatgaa aactttccct 
tcaacgattg cgtcgacaag atggtgatct ggtgggagga gggaaagatg accgccaagg 
tcgtggaatc tgccaaagcc attctgggtg gaagcaaggt tcgtgtggac cagaaatgca 
ggtcttcggc ccagatcgac ccgactccgg tgattgtcac ctctaacacc aacatgtgcg 
ccgtgattga cggaaactcg accaccttcg agcaccagca gccgttgcaa gaccggatgt 
tcaaatttga acttacccgc cgtttggatc atgactttgg gaaggtcacc aagcaggaag 
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tcaaagactt tttccggtgg gctcaagatc acgtgactga ggtggagcat gagttctacg 420 

tcaaaaaggg tggagccaag aaaaggcccg cccccgatga tgtatatata aatgagccca 480 

agcgggcgcg cgagtcagtt gcgcagccat cgacgtcaga cgcggaagct tcgataaact 540 

acgcggacag gtaccaaaac aaatgttctc gtcacgtggg catgaatctg atgctgtttc 600 

cctgtcgaca atgcgaaaga atgaatcaga attcaaatat ctgcttcaca cacgggcaaa 660 

aagactgttt ggaatgcttt cccgtgtcag aatctcaacc cgtttctgtc gtcagaaaaa 72 0 

cgtatcagaa actttgttac attcatcata tcatgggaaa agtaccagac gcctgcactg 780 

cctgcgacct ggtaaatgtg gacttggatg actgtatttc tgagcaataa atgacttaaa 840 

tcaggtatgg ctgctgacgg ttatcttcca gattggctcg aggacactct ctctgaagga 900 

atcagacagt ggtggaagct caaacctggc ccaccaccgc cgaaacctaa ccaacaacac 960 

cgggacgaca gtaggggtct tgtgcttcct gggtacaagt acctcggacc cttcaacgga 102 0 

ctcgacaaag gagagccggt caacgaggca gacgccgcgg ccctcgagca cgacaaagcc 1080 

tacgaccacc agctcaagca aggggacaac ccgtacctca aatacaacca cgcggacgct 1140 

gaatttcagg agcgtcttca agaagatacg tctttcgggg gcaacctcgg gcgagcagtc 1200 

ttccaggcca aaaagagggt actcgagcct cttggtctgg ttgaggaagc tgttaagacg 1260 

gctcctggaa aaaagagacc tatagagcag tctcctgcag aaccggactc ttcctcgggc 1320 

atcggcaaat caggccagca gcccgctaag aaaagactca attttggtca gactggcgac 1380 

acagagtcag tcccagaccc tcaaccaatc ggagaacccc ccgcagcccc ctctggtgtg 1440 

ggatctaata caatggcttc aggcggtggg gcaccaatgg cagacaataa cgaaggcgcc 1500 

gacggagtgg gtaattcctc gggaaattgg cattgcgatt ccacatggat gggcgacaga 1560 

gttatcacca ccagcacaag aacctgggcc ctccccacct acaataatcg cctctacaag 1620 

caaatctcca gcgaatcggg agccaccaac gacaaccact acttcggcta cagcaccccc 1680 

tgggggtatt ttgactttaa cagattccac tgtcacttct caccacgtga ctggcagcga 1740 

ctcatcaaca acaactgggg atttagaccc aagaaactca atttcaagct cttcaacatc 1800 

caagtcaagg aggtcacgca gaatgatgga accacgacca tcgccaataa ccttaccagc 18 60 

acggtgcagg tcttcacaga ctctgagtac cagctgccct acgtcctcgg ttcggctcac 1920 

cagggctgcc ttccgccgtt cccagcagac gtcttcatga ttcctcagta cggctacttg 1980 

actctgaaca atggcagcca agcggtagga cgttcttcat tctactgtct agagtatttt 2040 

ccctctcaga tgctgaggac gggaaacaac ttcaccttca gctacacttt tgaagacgtg 2100 

cctttccaca gcagctacgc gcacagccag agtctggatc ggctgatgaa tcctctcatt 2160 

gaccagtacc tgtattacct gagcaaaact cagggtacaa gtggaacaac gcagcaatcg 2220 

agactgcagt tcagccaagc tgggcctagc tccatggctc agcaggccaa aaactggcta 2280 
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ccgggaccca gctaccgaca gcagcgaatg tctaagacgg ctaatgacaa caacaacagt 2340 

gaatttgctt ggactgcagc caccaaatat tacctgaatg gaagaaattc tctggtcaat 2400 

cccgggcccc caatggccag tcacaaggac gatgaggaaa agtatttccc catgcacgga 24 60 

aatctcatct ttggaaaaca aggcacagga actaccaatg tggacattga atcagtgctt 2520 

attacagacg aagaagaaat cagaacaact aatcctgtgg ctacagaaca atacggacag 2580 

gttgccacca accatcagag tcagaacacc acagcttcct atggaagtgt ggacagccag 2640 

ggaatcttac ctggaatggt gtggcaggac cgcgatgtct atcttcaagg tcccatttgg 27 00 

gccaaaactc ctcacacgga cggacacttt catccttctc cgctcatggg aggctttgga 27 60 

ctgaaacacc ctcctcccca gatcctgatc aaaaacacac ctgtgccagc gaatcccgcg 2820 

accactttca ctcctggaaa gtttgcttcg ttcattaccc agtattccac cggacaggtc 2880 

agcgtggaaa tagagtggga gctgcagaaa gaaaacagca aacgctggaa cccagaaatt 2940 

cagtacacct ccaactacaa caagtcggtg aatgtggagt ttaccgtgga cgcaaacggt 3000 

gtttattctg aaccccgccc tattggcact cgttacctta cccggaactt gtaatttcct 3060 

gttaatgaat aaaccgattt atgcgtttca gttgaacttt ggtctctgcg aagggcgaat 3120 

tc 3122 

<210> 57 
<211> 3123 
<212> DNA 

<213> new AAV serotype, clone A3 . 3 
<400> 57 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga aaactttccc ttcaacgatt 60 

gcgtcgacaa gatggtgatc tggtgggagg agggaaagat gaccgccaag gtcgtggaat 120 

ctgccaaagc cattctgggt ggaggcaagg ttcgtgtgga ccagaaatgc aagtcttcgg 180 

cccagatcga cccgactccg gtgattgtca cctctaacac caacatgtgc gccgtgattg 240 

acggaaactc gaccaccttc gagcaccagc agccgttgca agaccggatg ttcaaatttg 300 

aacttacccg ccgtttggat catgactttg ggaaggtcac caagcaggaa gtcaaagact 360 

ttttccggtg ggctcaagat cacgtgactg aggtggagca tgagttctac gtcaaaaagg 420 

gtggagccaa gaaaaggccc gcccccgatg atgtatatat aaatgagccc aagcgggcgc 480 

gcgagtcagt tgcgcagcca tcgacgtcag acgcggaagc ttcgataaac tacgcggaca 540 

ggtaccaaaa caaatgttct cgtcacgtgg gcatgaatct gatgctgttt ccctgtcgac 600 

aatgcgaaag aatgaatcag aattcaaata tctgcttcac acacgggcaa aaagactgtt 6 60 

tggaatgctt tcccgtgtca gaatctcaac ccgtttctgt cgtcagaaaa acgtatcaga 720 
aactttgtta cattcatcat atcatgggaa aagtaccaga cgcctgcact gcctgcgacc 
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tggtaaatgt ggacttggat gactgtattt ctgagcaata aatgacttaa atcaggtatg 840 
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gctgctgacg 


gttatcttcc 


agattggctc 


gaggacactc 


tctctgaagg 


aatcagacag 


900 


t aatcrgaag c 


tcaaacctgg 


cccaccaccg 


ccgaaaccta 


accaacaaca 


ccgggacgac 

3 3 3 3 


960 


aatacrgcrcrtc 


ttgtgcttcc 


tgggtacaag 


tacctcggac 


ccttcaacgg 


actcgacaaa 


1020 


a a a cf a a c c cr cr 


tcaacgaggc 


agacgccgcg 


gccctcgagc 


acgacaaagc 


ctacgaccac 


1080 


cagctcaagc 


aaggggacaa 


cccgtacctc 


aaatacaacc 


acgcggacgc 


tgaatttcag 


1140 


gagcgtcttc 


aagaagatac 


gtctttcggg 


ggcaacctcg 


ggcgagcagt 


cttccaggcc 


1200 


aaaaagaggg 


tactcgagcc 


tcttggtctg 


gttgaggaag 

3 3 3 3 3 


ctgttaagac 


ggctcctgga 

3 3 3 3 


1260 


aaaaagagac 


ctatagagca 


gtctcctgca 


gaaccggact 


cttcctcggg 


catcggcaaa 


1320 


t caggccagc 


agcccgctaa 


gaaaagactc 


aattttggtc 


agactggcga 


cacagagtca 


1380 


gtcccaggcc 


ctcaaccaat 


cggagaaccc 


cccgcagccc 


cctctggtgt 


gggatctaat 


1440 


acaatggctt 


caggcggtgg 


ggcaccaatg 


gcagacaata 


acgaaggcgc 

3 3 3 3 


cgacggagtg 

3 3 3 3 3 


1500 


ggtaattcct 


cgggaaattg 


gcattgcgat 


tccacatgga 


tgggcgacag 

Z7 Z? Zf Z* ZJ 


agttatcacc 


1560 


accagcacaa 


gaacctgggc 


cctccccacc 


tacaataatc 


acctctacaa 


gcaaatctcc 


1620 


agcgaatcgg 


gagccaccaa 


cgacaaccac 


tacttcggct 


acagcacccc 


ctgggggtat 


1680 


tttgacttta 


acagattcca 


ctgtcacttc 


tcaccacgtg 


actggcagcg 

3 3 3 3 


actcatcaac 


1740 


aacaactggg 


gatttagacc 


caagaaactc 


aatttcaagc 


tcttcaacat 


ccaagtcaag 


1800 


gaggtcacgc 


agaatgatgg 


aaccacgacc 


atcgccaata 


accttaccag 


cgcggtgcag 


1860 


gtcttcacag 


actctgagta 

3 3 


ccagctgccc 


tacgtcctcg 


gttcggctca 


ccagggctgc 


1920 


cttccgccgt 


tcccagcaga 


cgtcttcatg 


attcctcagt 


acggctactt 


gactctgaac 


1980 


a. at ggcagcc 


aaacaataacr 


acgttcttca 


ttctactgtc 


t agagtattt 


tccctctcag 


2040 


atgctgagga 


cgggaaacaa 


cttcaccttc 


agctacac tt 


ttgaagacgt 


gcctttccac 


2100 


agcagctacg 


cgcacagcca 


gagtctggat 

3 3 3 3 


cggctgatga 


atcctctcat 


tgaccagtac 


2160 


ctgtattacc 


tgagcaaaac 


tcagggtaca 


agtggaacaa 


cgcagcaatc 


gagactgcag 


2220 


ttcagccaag 


ctgggcctag 


ctccatggct 


cagcaggcca 


aaaactggct 


accgggaccc 


2280 


agctaccgac 


agcagcgaat 


gtctaagacg 


gctaatgaca 


acaacaacag 


tgaatttgct 


2340 


tgcractgcag 


ccaccaaata 


ttacctgaat 


ggaagaaatt 


ctctggtcaa 


tcccgggccc 


2400 


ccagtggcca 


gtcacaagga 


cgatgaggaa 

3 *** 3 3 3 


aagtatttcc 


ccatgcacgg 


aaatctcatc 


2460 


tttggaaaac 


aaggcacagg 


aactaccaat 


gtggacattg 

3 7 3 3 


aatcagtgct 


tattacagac 


2520 


gaagaagaaa 


tcagaacaac 


taatcctgtg 


gctacagaac 


aatacggaca 


ggttgccacc 


2580 


aaccatcaga 


gtcagaacac 


cacagcttcc 


tatggaagtg 


tggacagcca 


gggaatctta 


2 640 


cctggaatgg 


tgtggcagga 


ccgcgatgtc 


tatcttcaag 


gtcccatttg 


ggccaaaact 


2700 


cctcacacgg 


acggacactt 


tcatccttct 


ccgctcatgg 


gaggctttgg 


actgaaacac 


2760 



101 



WO 03/042397 



PCT/US02/33629 



cctcctcccc 
actcctggaa 
atagagtggg 
tccaactaca 
gaaccccgcc 
taagccgatt 
cct 



agatcctgat caaaaacaca cctgtgccag cgaatcccgc gaccactttc 
agtttgcttc gttcattacc cagtattcca cctgacaggt cagcgtggaa 
agctgcagaa agaaaacagc aaacgctgga acccagaaat tcagtacacc 
acaagtcggt gaatgtggag tttaccgtgg acgcaaacgg tgtttattct 
ctattggcac tcgttacctt acccggaact tgtaatttcc tgttaatgaa 
tatgcgtttc agttgaactt tggtctctgc gaagggcgaa ttcgtttaaa 



2820 
2880 
2940 
3000 
3060 
3120 
3123 



<210> 58 
<211> 2969 
<212> DNA 

<213> new AAV serotype, clone 42.12 
<400> 58 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 
gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 
ccgccaaggc cattctcggc ggcagcaagg tgcgcgtgga ccaaaagtgc aagtcgtccg 
cccagatcga ccccblccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 
acgggaacag caccaccttc gagcaccagc agccgttaca agaccggatg ttcaaatttg 
aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcaaagagt 
tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca tgagttctac gtcagaaagg 
gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 
gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 
acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 
agacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 
gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaag aggacgtatc 
ggaaactctg tgccattcat catctgctgg ggcgggctcc cgagattgct tgctcggcct 
gcgatctggt caacgtggac ctggatgact gtgtttctga gcaataaatg acttaaacca 
ggtatggctg ccgatggtta tcttccagat tggctcgagg acaacctctc tgagggcatc 
cgcgagtggt gggacttgaa acctggagcc ccgaaaccca aagccaacca gcaaaagcag 
gacgacggcc ggggtctggt gcttcctggc tacaagtacc tcggaccctt caacggactc 
gacaagggag agccggtcaa cgaggcagac gccgcggccc tcgagcacga caaggcctac 
gacaagcagc tcgagcaggg ggacaacccg tacctcaagt acaaccacgc cgacgccgag 
tttcaggagc gtcttcaaga agatacgtct tttgggggca acctcgggcg agcagtcttc 
caggccaaga agcgggttct cgaacctctc ggtctggttg aggaaggcgc taagacggct 
cctggaaaga agagaccggt agagccatca ccccagcgtt ctccagactc ctctacgggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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atcggcaaga 


caggccagca 


gcccgcgaaa 


aagagactca 


actttgggca 


gactggcgac 


1380 


tcagagtcag 


tgcccgaccc 


tcaaccaatc 


ggagaacccc 


ccgcaggccc 


ctctggtctg 


1440 


ggatctggta 


caatggctgc 


aggcggtggc 


gctccaatgg 


cagacaataa 


cgaaggcgcc 


1500 


gacqgagtgg 

17 ^7 -5 " 3 3 


gtagttcctc 


aggaaattgg 


cattgcgatt 


ccacatggct 


gggcgacaga 


1560 


gtcatcacca 


ccagcacccg 


aacctgggcc 


ctccccacct 


acaacaacca 


cctctacaag 


1620 


caaatctcca 


acgggacatc 


gggaggaagc 


accaacgaca 


acacctactt 


cggctacagc 


1680 


accccctggg 


ggtattttga 


ctttaacaga ttccactgcc 


acttctcacc 


acgtgactgg 


1740 


cagcgactca 


tcaacaacaa 


ctggggattc 


cggcccaaga 


gactcaactt 


caagctcttc 


1800 


aacatccagg 


tcaaggaggt 


cacgcagaat 


gaaggcacca 


agaccatcgc 


caataacctt 


1860 


accagcacga 


ttcaggtctt 


tacggactcg 


gaataccagc 


tcccgtacgt 


cctcggctct 


1920 


gcgcaccagg 


gctgcctgcc 


tccgttcccg 


gcggacgtct 


tcatgattcc 


tcagtacggg 


1980 


tacctgactc 


tgaacaacgg 


cagtcaggcc 


gtgggccgtt 


cctccttcta 


ctgcctggag 


2040 


tactttcctt 


ctcaaatgct 


gagaacgggc 


aacaactttg 


agttcagcta 


ccagtttgag 


2100 


gacgtgcctt 


ttcacagcag 


ctacgcgcac 


agccaaagcc 


tggaccggct 


gacgaacccc 


2160 


ctcatcgacc 


agtacctgta 


ctacctggcc 


cggacccaga 


gcactacggg 


gtccacaagg 


2220 


gggctgcagt 


tccatcaggc 


tgggcccaac 


accatggccg 


agcaatcaaa 


gaactggctg 


2280 


cccggaccct 


gttatcggca 


gcagagactg 


tcaaaaaaca 


tagacagcaa 


caacaacagt 


2340 


aactttgcct 


ggaccggggc 


cactaaatac 


catctgaatg 


gtagaaattc 


attaaccaac 


2400 


ccgggcgtag 


ccatggccac 


caacaaggac 


gacgaggacc 


agttctttcc 


catcaacgga 


2460 


gtgctggttt 


ttggcaaaac 


gggggctgcc 


aacaagacaa 


cgctggaaaa 


cgtgctaatg 


2520 


accagcgagg 

3 3 3 3 


aggagatcaa 


aaccaccaat 


cccgtggcta 


cagaagaata 


cggtgtggtc 


2580 


tccagcaacc 


tgcaatcgtc 


tacggccgga 


ccccagacac 


agactgtcaa cagccagggg 


2640 


gctctgcccg 


gcatggtctg 

3 w 3 3 3 


gcagaaccgg 


gacgtgtacc 


tgcagggtcc 


catctgggcc 


2700 


aaaattcctc 


acacggacgg 


caactttcac 


ccgtctcccc 


tgatgggcgg 


atttggactc 


2760 


aaacacccgc 


ctcctcaaat 


tctcatcaag 


tatacttcca 


actactacaa 


atctacaaat 


2820 


guggacut-i-y 


(_ L. y LtaaLaL 


tgagggtact 


tattcagagc 


ctcgccccat 


tggcacccgt 


2880 


tacctcaccc 


gtaacctgta 


attgcctgtt 


aatcaataaa 


ccggttaatt 


cgtttcagtt 


2940 


gaactttggt 


ctctgcgaag 


ggcgaattc 








2969 



<210> 59 
<211> 3129 
<212> DNA 

<213> new AAV serotype, clone 44.2 
<400> 59 

gaattcgccc tttctacggc tgcgtcaact ggaccaatga gaactttccc ttcaacgatt 
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gcgtcgacaa gatggtgatc tggtgggagg agggcaagat gacggccaag gtcgtggagt 120 

ccgccaaggc cattctcggc ggcagcaaag tgcgcgtgga ccaaaagtgc aagtcgtccg 180 

cccagatcga ccccaccccc gtgatcgtca cctccaacac caacatgtgc gccgtgattg 240 

acgggaacag caccaccttc gagcaccagc agccgttgca ggaccggatg ttcaagtttg 300 

aactcacccg ccgtctggag cacgactttg gcaaggtgac aaagcaggaa gtcagagagt 360 

tcttccgctg ggcgcaggat cacgtgaccg aggtggcgca cgagttctac gtcagaaagg 420 

gtggagccaa caagagaccc gcccccgatg acgcggataa aagcgagccc aagcgggcct 480 

gcccctcagt cgcggatcca tcgacgtcag acgcggaagg agctccggtg gactttgccg 540 

acaggtacca aaacaaatgt tctcgtcacg cgggcatgct tcagatgctg tttccctgca 600 

aaacatgcga gagaatgaat cagaatttca acatttgctt cacgcacggg accagagact 660 

gttcagaatg tttccccggc gtgtcagaat ctcaaccggt cgtcagaaaa aagacgtatc 720 

ggaaactctg tgcgattcat catctgctgg gggcgggcac ccgagattgc ttgctcggcc 780 

tgcgatctgg tcaacgtgga cctagatgac tgtgtttctg agcaataaat gacttaaacc 840 

aggtatggct gccgatggtt atcttccaga ttggctcgag gacaacctct ctgagggcat 900 

tcgcgagtgg tgggacttga aacctggagc cccgaaaccc aaagccaacc agcaaaagca 960 

ggacgacggc cggggtctgg tgcttcctgg ctacaagtac ctcggaccct tcaacggact 1020 

cgacaagggg gagcccgtca acgcggcgga cgcagcggcc ctcgagcacg acaaggccta 1080 

cgaccagcag ctcaaagcgg gtgacaatcc gtacctgcgg tataaccacg ccgacgccga 1140 

gtttcaggag cgtctgcaag aagatacgtc ttttgggggc aacctcgggc gagcagtctt 1200 

ccaggccaag aagcgggttc tcgaacctct cggtctggtt gaggaaggcg ctaagacggc 1260 

tcctggaaag aagagaccgg tagagccatc accccagcgt tctccagact cctctacggg 1320 

catcggcaag aaaggccagc agcccgcgaa aaagagactc aactttgggc agactggcga 1380 

ctcagagtca gtgcccgacc ctcaaccaat cggagaaccc cccgcaggcc cctctggtct 1440 

gggatctggt acaatggctg caggcggtgg cgctccaatg gcagacaata acgaaggcgc 1500 

cgacggagtg ggtagttcct caggaaattg gcattgcgat tccacatggc tgggcgacag 1560 

agtcatcacc accagcaccc gaacctgggc cctccccacc tacaacaacc acctctacaa 1620 

gcaaatctcc aacgggactt cgggaggaag caccaacgac aacacctact tcggctacag 1680 

caccccctgg gggtattttg actttaacag attccactgc cacttctcac cacgtgactg 1740 

gcagcgactc atcaacaaca actggggatt ccggcccaag agactcaact tcaagctctt 1800 

caacatccag gtcaaggagg tcacgcagaa tgaaggcacc aagaccatcg ccaataacct 1860 

taccagcacg attcaggtct ttacggactc ggaataccag ctcccgtacg tcctcggctc 1920 

tgcgcaccag ggctgcctgc ctccgttccc ggcggacgtc ttcatgattc ctcagtacgg 1980 
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gtacctgact ctgaacaatg gcagtcaggc cgtgggccgt tcctccttct actgcctgga 2040 

gtactttcct tctcaaatgc tgagaacggg caacaacttt gagttcagct accagtttga 2100 

ggacgtgcct tttcacagca gctacgcgca cagccaaagc ctggaccggc tgatgaaccc 2160 

cctcatcgac cagtacctgt actacctgtc tcggactcag tccacgggag gtaccgcagg 2220 

aactcagcag ttgctatttt ctcaggccgg gcctaataac atgtcggctc aggccaaaaa 2280 

ctggctaccc gggccctgct accggcagca acgcgtctcc acgacactgt cgcaaaataa 2340 

caacagcaac tttgcctgga ccggtgccac caagtatcat ctgaatggca gagactctct 2400 

ggtaaatccc ggtgtcgcta tggcaaccca caaggacgac gaagagcgat tttttccgtc 2460 

cagcggagtc ttaatgtttg ggaaacaggg agctggaaaa gacaacgtgg actatagcag 2520 

cgttatgcta accagtgagg aagaaattaa aaccaccaac ccagtggcca cagaacagta 2580 

cggcgtggtg gccgataacc tgcaacagca aaacgccgct cctattgtag gggccgtcaa 2640 

cagtcaagga gccttacctg gcatggtctg gcagaaccgg gacgtgtacc tgcagggtcc 2700 

tatctgggcc aagattcctc acacggacgg aaactttcat ccctcgccgc tgatgggagg 2760 

ctttggactg aaacacccgc ctcctcagat cctgattaag aatacacctg ttcccgcgga 2820 

tcctccaact accttcagtc aagctaagct ggcgtcgttc atcacgcagt acagcaccgg 2880 

acaggtcagc gtggaaattg aatgggagct gcagaaagaa aacagcaaac gctggaaccc 2940 

agagattcaa tacacttcca actactacaa atctacaaat gtggactttg ctgttaacac 3000 

agatggcact tattctgagc ctcgccccat cggcacccgt tacctcaccc gtaatctgta 3060 

attgcttgtt aatcaataaa ccggttgatt cgtttcagtt gaactttggt ctctgcgaag 3120 

ggcgaattc 3129 

<210> 60 
<211> 733 
<212> PRT 

<213> capsid protein of AAV serotype, clone C1VP1 



Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 
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Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Leu Glu Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly He Gly Lys 
145 150 155 160 



Lys Gly Lys Gin Pro Ala Lys Lys Arg Leu Asn Phe Glu Glu Asp Thr 
165 170 175 



Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Asp Thr Ser Ala Met Ser 
180 ~ 185 190 



Ser Asp He Glu Met Arg Ala Ala Pro Gly Gly Asn Ala Val Asp Ala 
195 200 205 



Gly Gin Gly Ser Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 
210 215 220 



Asp Ser Thr Trp Ser Glu Gly Lys Val Thr Thr Thr Ser Thr Arg Thr 
225 230 235 240 



Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Leu Arg Leu Gly Thr 
245 250 255 



Thr Ser Asn Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 
260 265 270 



Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
275 280 285 



Arg Leu He Asn Asn Asn Trp Gly Leu Arg Pro Lys Ala Met Arg Val 
290 295 300 



Lys He Phe Asn He Gin Val Lys Glu Val Thr Thr Ser Asn Gly Glu 
305 310 315 320 
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Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin lie Phe Ala Asp 
325 330 335 



Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu Gly Ser 
340 345 350 



Leu Ser Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr Gly Tyr 
355 360 365 

Cys Gly lie Val Thr Gly Glu Asn Gin Asn Gin Thr Asp Arg Asn Ala 

370 375 380 

Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn 
385 "* 390 395 400 



Asn Phe Glu Met Ala Tyr Asn Phe Gly Lys Val Pro Phe His Ser Met 
405 410 415 



Tyr Ala Tyr Ser Gin Ser Pro Asp Arg Leu Met Asn Pro Leu Leu Asp 
420 425 430 



Gin Tyr Leu Trp His Leu Gin Ser Thr Thr Ser Gly Glu Thr Leu Asn 
435 440 445 



Gin Gly Asn Ala Ala Thr Thr Phe Gly Lys He Arg Ser Gly Asp Phe 
450 455 460 

Ala Phe Tyr Arg Lys Asn Trp Leu Pro Gly Pro Cys Val Lys Gin Gin 
465 470 475 480 

Arg Leu Ser Lys Thr Ala Ser Gin Asn Tyr Lys He Pro Ala Ser Gly 
485 490 495 



Gly Asn Ala Leu Leu Lys Tyr Asp Thr His Tyr Thr Leu Asn Asn Arg 
500 505 510 



Trp Ser Asn He Ala Pro Gly Pro Pro Met Ala Thr Ala Gly Pro Ser 
515 520 525 

Asp Gly Asp Phe Ser Asn Ala Gin Leu He Phe Pro Gly Pro Ser Val 
530 535 540 

Thr Gly Asn Thr Thr Thr Ser Ala Asn Asn Leu Leu Phe Thr Ser Glu 
545 550 555 560 

Glu Glu He Ala Ala Thr Asn Pro Arg Asp Thr Asp Met Phe Gly Gin 
565 570 575 
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lie Ala Asp Asn Asn Gin Asn Ala Thr Thr Ala Pro lie Thr Gly Asn 
580 585 590 



Val Thr Ala Met Gly Val Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 



lie Tyr Tyr Gin Gly Pro He Trp Ala Lys He Pro His Ala Asp Gly 
610 615 620 



His Phe His Pro Ser Pro Leu He Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin He Phe He Lys Asn Thr Pro Val Pro Ala Asn Pro Ala 
645 650 . 655 



Thr Thr Phe Thr Ala Ala Arg Val Asp Ser Phe He Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ala Val Gin He Glu Trp Glu He Glu Lys Glu Arg 
675 680 685 



Ser Lys Arg Trp Asn Pro Glu Val Gin Phe Thr Ser Asn Tyr Gly Asn 
690 695 700 



Gin Ser Ser Met Leu Trp Ala Pro Asp Thr Thr Gly Lys Tyr Thr Glu 
705 710 715 720 

I 

Pro Arg Val He Gly Ser Arg Tyr Leu Thr Asn His Leu 
725 730 



<210> 61 
<211> 733 
<212> PRT 

<213> capsid protein of AAV serotype, clone C2VP1 
<400> 61 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Leu 
2 0 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe His Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 
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Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 ~ 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Leu Glu Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly Lys 
145 150 155 160 * 

Lys Gly Lys Gin Pro Ala Lys Lys Arg Leu Asn Phe Glu Glu Asp Thr 
165 170 175 

Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Asp. Thr Ser Ala Met Ser 
180 185 190 

Ser Asp lie Glu Met Arg Ala Ala Pro Gly Gly Asn Ala Val Asp Ala 
195 200 205 

Gly Gin Gly Ser Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 
210 215 220 

Asp Ser Thr Trp Ser Glu Gly Lys Val Thr Thr Thr Ser Thr Arg Thr 
225 230 235 240 

Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Leu Arg Leu Gly Thr 
245 250 255 



Thr Ser Asn Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 
260 265 270 



Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
275 280 285 

Arg Leu He Asn Asn Asn Trp Gly Leu Arg Pro Lys Ala Met Arg Val 
290 295 300 

Lys He Phe Asn He Gin Val Lys Glu Val Thr Thr Ser Asn Gly Glu 
305 310 315 320 

Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin He Phe Ala Asp 
325 330 335 
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Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu Gly Ser 
340 345 350 



Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr Gly Tyr 
355 360 365 



Cys Gly lie Val Thr Gly Glu Asn Gin Asn Gin Thr Asp Arg Asn Ala 
370 375 380 



Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg. Thr Gly Asn 
385 390 395 400 



Asn Phe Glu Met Ala Tyr Asn Phe Glu Lys Val Pro Phe His Ser- Met 
405 410 415 



Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro Leu Leu Asp 
420 425 430 



Gin Tyr Leu Trp His Leu Gin Ser Thr Thr Ser Gly Glu Thr Leu Asn 
435 440 445 



Gin Gly Asn Ala Ala Thr Thr Phe Gly Lys lie Arg Ser Gly Asp Phe 
450 455 460 



Ala Phe Tyr Arg Lys Asn Trp Leu Pro Gly Pro Cys Val Lys Gin Gin 
465 470 475 480 



Arg Phe Ser Lys Thr Ala Ser Gin Asn Tyr Lys lie Pro Ala Ser Gly 
485 490 495 



Gly Asn Ala Leu Leu Lys Tyr Asp Thr His Tyr Thr Leu Asn Asn Arg 
500 505 510 



Trp Ser Asn He Ala Pro Gly Pro Pro Met Ala Thr Ala Gly Pro Ser 
515 520 525 



Asp Gly Asp Phe Ser Asn Ala Gin Leu He Phe Pro Gly Pro Ser Val 
530 ~ 535 540 



Thr Gly Asn Thr Thr Thr Ser Ala Asn Asn Leu Leu Phe Thr Ser Glu 
545 550 555 560 



Gly Glu He Ala Ala Thr Asn Pro Arg Asp Thr Asp Met Phe Gly Gin 
565 570 575 



He Ala Asp Asn Asn Gin Asn Ala Thr Thr Ala Pro He Thr Gly Asn 
580 585 590 
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Val Thr Ala Met Gly Val Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 

He Tyr Tyr Gin Gly Pro He Trp Ala Lys He Pro His Ala Asp Gly 
610 615 620 

His Phe His Pro Ser Pro Leu He Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 

Pro Pro Gin He Phe He Lys Asn Thr Pro Val Pro Ala Asn Pro Ala 
645 650 655 

Thr Thr Phe Thr Ala Ala Arg Val Asp Ser Phe He Thr Gin Tyr Ser 
660 665 670 

Thr Gly Gin Val Ala Val Gin He Glu Trp Glu He Glu Lys Glu Arg 
675 680 685 

Ser Lys Arg Arg Asn Pro Glu Val Gin Phe Thr Ser Asn Tyr Gly Asn 
690 695 700 

Gin Ser Ser Met Leu Trp Ala Pro Asp Thr Thr Gly Lys Tyr Thr Glu 
705 710 715 720 

Pro Arg Val He Gly Ser Arg Tyr Leu Thr Asn His Leu 
725 730 



<210> 62 

<211> 733 

<212> PRT 

<213> capsid protein of AAV serotype, clone C5VP162 

<400> 62 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1.5 10 I 5 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Glu Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 ' 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 
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Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 



Asn leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Leu Glu Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly Lys 
145 150 155 160 



Lys Gly Lys Gin Pro Ala Lys Lys Arg Leu Asn Phe Glu Glu Asp Thr 
165 170 175 



Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Asp Thr Ser Ala Met Ser 
180 " 185 190 



Ser Asp lie Glu Met Arg Ala Ala Pro Gly Gly Asn Ala Val Asp Ala 
195 200 205 



Gly Gin Gly Ser Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 
210 215 220 



Asp Ser Thr Trp Ser Glu Gly Lys Val Thr Thr Thr Ser Thr Arg Thr 
225 230 235 240 



Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Leu Arg Leu Gly Thr 
245 250 255 



Thr Ser Asn Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 
260 265 270 



Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
275 280 285 



Arg Leu lie Asn Asn Asn Trp Gly Leu Arg Pro Lys Ala Met Arg Val 
290 295 300 



Lys lie Phe Asn He Gin Val Lys Glu Val Thr Thr Ser Asn Gly Glu 
305 310 315 320 



Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin He Phe Ala Asp 
325 330 335 



Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu Gly Ser 
340 345 350 
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Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr Gly Tyr 
355 360 365 



Cys Gly He Val Thr Gly Glu Asn Gin Asn Gin Thr Asp Arg Asn Ala 
370 375 380 



Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn 
385 390 395 400 



Asn Phe Glu Thr Ala Tyr Asn Phe Glu Lys Val Pro Phe His Ser Met 
405 410 415 



Tyr Ala His Ser Gin Ser Leu Asp Gly Leu Met Asn Pro Leu Leu Asp 
420 425 430 



Gin Tyr Leu Trp His Leu Gin Ser Thr Thr Ser Gly Glu Thr Leu Asn 
435 440 445 

Gin Gly Asn Ala Ala Thr Thr Phe Gly Lys He Arg Ser Gly Asp Phe 
450 455 460 

Ala Phe Tyr Arg Lys Asn Trp Leu Pro Gly Pro Cys Val Lys Gin Gin 
465 "* 470 475 480 

Arg Phe Ser Lys Thr Ala Ser Gin Asn Tyr Lys He Pro Ala Ser Gly 
485 490 495 



Gly Asn Ala Leu Leu Lys Tyr Asp Thr His Tyr Thr Leu Asn Asn Arg 
500 505 510 



Trp Ser Asn He Ala Pro Gly Pro Pro Met Ala Thr Ala Gly Pro Ser 
515 520 525 



Asp Gly Asp Phe Ser Asn Ala Gin Leu He Phe Pro Gly Pro Ser Val 
530 535 540 

Thr Gly Asn Thr Thr Thr Ser Ala Asn Asn Leu Leu Phe Thr Ser Glu 
545 550 555 560 

Glu Glu He Ala Ala Thr Asn Pro Arg Asp Thr Asp Met Phe Gly Gin 
565 570 575 



He Ala Asp Asn Asn Gin Asn Ala Thr Thr Ala Pro He Thr Gly Asn 
580 585 590 



Val Thr Ala Met Gly Val Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 
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lie Tyr Tyr Gin Gly Pro lie Trp Ala Lys lie Pro His Ala Asp Gly 
610 615 620 



His Phe His Pro Ser Pro Leu lie Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin He Phe He Lys Asn Thr Pro Val Pro Ala Tyr Pro Ala 
645 650 655 



Thr Thr Phe Thr Ala Ala Arg Val Asp Ser Phe He Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ala Val Gin He Glu Trp Glu He Glu Lys Glu Arg 
675 680 685 



Ser Lys Arg Trp Asn Pro Glu Val Gin Phe Thr Ser Asn Cys Gly Asn 
690 695 700 



Gin Ser Ser Met Leu Trp Ala Pro Asp Thr Thr Gly Lys Tyr Thr Glu 
705 710 715 720 



Pro Arg Val He Gly Ser Arg Tyr Leu Thr Asn His Leu 
725 730 



<210> 63 
<211> 734 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV4VP1 
<400> 63 

Met Thr Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser Glu 
15 10 15 



Gly Val Arg Glu Trp Trp Ala Leu Gin Pro Gly Ala Pro Lys Pro Lys 
20 ' 25 30 



Ala Asn Gin Gin His Gin Asp Asn Ala Arg Gly Leu Val Leu Pro Gly 
35 40 45 



Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro Val 
50 55 60 



Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp Gin 
65 70 75 80 



Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 
85 90 95 



Ala Glu Phe Gin Gin Arg Leu Gin Gly Asp Thr Ser Phe Gly Gly Asn 
100 105 110 
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Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro Leu 
115 120 125 

Gly Leu Val Glu Gin Ala Gly Glu Thr Ala Pro Gly Lys Lys Arg Pro 
130 135 140 

Leu He Glu Ser Pro Gin Gin Pro Asp Ser Ser Thr Gly He Gly Lys 
145 150 155 160 

Lvs Gly Lys Gin Pro Ala Lys Lys Lys Leu Val Phe Glu Asp Glu Thr 
165 " 170 175 

Gly Ala Gly Asp Gly Pro Pro Glu Gly Ser Thr Ser Gly Ala Met Ser 
180 185 190 

Asp Asp Ser Glu Met Arg Ala Ala Ala Gly Gly Ala Ala Val Glu Gly 
195 200 205 

Gly Gin Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys 
210 215 220 

Asp Ser Thr Trp Ser Glu Gly His Val Thr Thr Thr Ser Thr Arg Thr 
225 ^ 230 235 240 

Trp Val Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Arg Leu Gly Glu 
245 250 255 

Ser Leu Gin Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 
260 265 270 

Phe Asp Phe Asn Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin 
275 280 285 

Arg Leu He Asn Asn Asn Trp Gly Met Arg Pro Lys Ala Met Arg Val 
290 295 300 

Lys He Phe Asn He Gin Val Lys Glu Val Thr Thr Ser Asn Gly Glu 
305 310 " 315 320 

Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin He Phe Ala Asp 
325 330 335 

Ser Ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu Gly Ser 
340 345 350 



Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr Gly Tyr 
355 360 365 
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Cys Gly Leu Val Thr Gly Asn Thr Ser Gin Gin Gin Thr Asp Arg Asn 
370 375 380 



Ala Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg Thr Gly 
385 390 395 400 



Asn Asn Phe Glu lie Thr Tyr Ser Phe Glu Lys Val Pro Phe His Ser 
405 410 415 



Met Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro Leu lie 
420 425 430 



Asp Gin Tyr Leu Trp Gly Leu Gin Ser Thr Thr Thr Gly Thr Thr Leu 
435 440 445 



Asn Ala Gly Thr Ala Thr Thr Asn Phe Thr Lys Leu Arg Pro Thr Asn 
450 455 460 



Phe Ser Asn Phe Lys Lys Asn Trp Leu Pro Gly Pro Ser lie Lys Gin 
465 470 475 480 



Gin Gly Phe Ser Lys Thr Ala Asn Gin Asn Tyr Lys lie Pro Ala Thr 
485 490 495 



Gly Ser Asp Ser Leu lie Lys Tyr Glu Thr His Ser Thr Leu Asp Gly 
500 505 510 



Arg Trp Ser Ala Leu Thr Pro Gly Pro Pro Met Ala Thr Ala Gly Pro 
515 520 525 



Ala Asp Ser Lys Phe Ser Asn Ser Gin Leu lie Phe Ala Gly Pro Lys 
530 535 540 



Gin Asn Gly Asn Thr Ala Thr Val Pro Gly Thr Leu lie Phe Thr Ser 
545 550 555 560 



Glu Glu Glu Leu Ala Ala Thr Asn Ala Thr Asp Thr Asp Met Trp Gly 
565 570 575 



Asn Leu Pro Gly Gly Asp Gin Ser Asn Ser Asn Leu Pro Thr Val Asp 
580 585 590 



Arg Leu Thr Ala Leu Gly Ala Val Pro Gly Met Val Trp Gin Asn Arg 
595 600 605 



Asp lie Tyr Tyr Gin Gly Pro lie Trp Ala Lys lie Pro His Thr Asp 
610 615 620 
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Gly His Phe His Pro Ser Pro Leu He Gly Gly Phe Gly Leu Lys His 
625 630 635 640 

Pro Pro Pro Gin He Phe He Lys Asn Thr Pro Val Pro Ala Asn Pro 
645 650 655 

Ala Thr Thr Phe Ser Ser Thr Pro Val Asn Ser Phe He Thr Gin Tyr 
660 665 670 

Ser Thr Gly Gin Val Ser Val Gin He Asp Trp Glu He Gin Lys Glu 
675 680 685 

Arg Ser Lys Arg Trp Asn Pro Glu Val Gin Phe Thr Ser Asn Tyr Gly 
690 695 700 

Gin Gin Asn Ser Leu Leu Trp Ala Pro Asp Ala Ala Gly Lys Tyr Thr 
705 710 715 720 

Glu Pro Arg Ala He Gly Thr Arg Tyr Leu Thr His His Leu 
725 730 

<210> 64 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV1 
<400> 64 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 
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Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
210 215 220 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
260 265 270 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
275 280 285 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
290 295 300 

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin 
305 1 310 315 320 

Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
325 330 335 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
340 345 350 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
355 " 360 365 

Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
370 375 380 
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Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 

405 410 415 



Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
420 425 430 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 
435 440 445 



Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 
450 455 460 



Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 ~ 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 
485 490 495 



Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
500 505 510 



Gly Arg Glu Ser lie lie Asn Pro Gly Thr Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met lie Phe Gly 

530 535 540 

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met He 

545 550 555 560 



Thr Asp Glu Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
565 " 570 575 



Phe Gly Thr Val Ala Val Asn Phe Gin Ser Ser Ser Thr Asp Pro Ala 
580 585 590 



Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 

Asp Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 ~ 630 635 640 
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Lys Asn Pro Pro Pro Gin He Leu lie Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe He Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn 
690 ' 695 700 



Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705 ' 710 715 720 



Tyr Thr Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 

<210> 65 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV 6 VP 1 
<400> 65 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Phe Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 
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Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
210 215 220 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 

225 " 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
260 265 270 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
275 280 285 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
290 295 300 

Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin 

305 310 315 320 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
325 330 335 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
340 345 350 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
355 360 365 



Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
370 375 380 



Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 390 395 400 
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Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
405 410 415 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
420 425 430 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 
435 440 445 



Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 
450 455 460 



Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 
485 490 495 



Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
500 505 510 



Gly Arg Glu Ser He He Asn Pro Gly Thr Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Lys Asp Lys Phe Phe Pro Met Ser Gly Val Met He Phe Gly 
530 535 540 

Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met He 

545 550 555 560 



Thr Asp Glu Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
565 570 575 



Phe Gly Thr Val Ala Val Asn Leu Gin Ser Ser Ser Thr Asp Pro Ala 
580 585 590 

Thr Gly Asp Val His Val Met Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 



Asp Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 ~ 615 620 



Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 

625 630 635 640 

Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 

645 650 655 
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Asn Pro Pro Ala Glu Phe Ser Ala Thx Lys Phe Ala Ser Phe lie Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Ala Lys Ser Ala Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu 
705 ~ 710 715 720 



Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 



<210> 66 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone A3. 3 
<400> 66 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
15 10 15 



Glu Gly lie Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 



Lys Pro Asn Gin Gin His Arg Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 4 0 4 5 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



His Gin Leu Lys Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Ala Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 
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Pro lie Glu Gin Ser Pro Ala Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 155 160 



Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Thr Glu Ser Val Pro Gly Pro Gin Pro He Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val He 
225 " 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Glu Ser Gly Ala Thr Asn Asp Asn His Tyr 
2 60 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin Val 
305 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr He Ala Asn Asn Leu 
325 330 335 



Thr Ser Ala Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 
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Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 " 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ser Lys Thr 
435 440 445 

Gin Gly Thr Ser Gly Thr Thr Gin Gin Ser Arg Leu Gin Phe Ser Gin 
450 455 460 

Ala Gly Pro Ser Ser Met Ala Gin Gin Ala Lys Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Ser Tyr Arg Gin Gin Arg Met Ser Lys Thr Ala Asn Asp Asn Asn 
485 490 495 



Asn Ser Glu Phe Ala Trp Thr Ala Ala Thr Lys Tyr Tyr Leu Asn Gly 
500 505 510 



Arg Asn Ser Leu Val Asn Pro Gly Pro Pro Val Ala Ser His Lys Asp 
515 520 525 

Asp Glu Glu Lys Tyr Phe Pro Met His Gly Asn Leu lie Phe Gly Lys 
530 ~ 535 540 

Gin Gly Thr Gly Thr Thr Asn Val Asp lie Glu Ser Val Leu lie Thr 
545 550 555 560 

Asp Glu Glu Glu He Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Gin Val Ala Thr Asn His Gin Ser Gin Asn Thr Thr Ala Ser Tyr 
580 585 590 



Gly Ser Val Asp Ser Gin Gly He Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 

Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys Thr " Pro His Thr 
610 615 620 

Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 

His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 
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Pro Ala Thr Thr Phe Thr Pro Gly Lys Phe Ala Ser Phe lie Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin Lys 
675 ^ 680 685 

Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 

Asn Lys Ser Val Asn Val Glu Phe Thr Val Asp Ala Asn Gly Val Tyr 
705 710 715 720 



Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 67 

<211> 735 

<212> PRT 

<213> capsid protein of AAV serotype, clone A3. 7 

<400> 67 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1 5 10 15 

Glu Gly He Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 

Lys Pro Asn Gin Gin His Arg Asp Asp Ser Arg Gly Leu Val Leu Pro 
3 5 40 4 5 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " 55 60 

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

His Gin Leu Lys Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Ala Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Ile'Glu Gin Ser Pro Ala Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 155 160 
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Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 17 0 17 5 



Gly Asp Thr Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn Arg Leu 
245 250 255 



Tyr Lys Gin lie Ser Ser Glu Ser Gly Ala Thr Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie 'Asn Asn Asn Trp 

290 295 300 

Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn lie Gin Val 

305 * 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr lie Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 
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ASP val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
P 420 425 4JU 

L eu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Lys Thr 



435 



Gin Gly Thr Ser Gly Thr Thr Gin Gin Ser Arg Leu Gin Phe Ser Gin 
4 50 455 

Ala Gly Pro Ser Ser Met Ala Gin Gin Ala Lys Asn Trp Leu Pro Gly 
465 470 4/5 

Pro Ser Tyr Arg Gin Gin Arg Met Ser Lys Thr Ala Asn Asp Asn Asn 
485 490 

Asn Ser Glu Phe Ala Trp Thr Ala Ala Thr Lys Tyr Tyr Leu Asn Gly 
500 505 

Arg Asn Ser Leu Val Asn Pro Gly Pro Pro Met Ala Ser His Lys Asp 
515 520 

Asp Glu Glu Lys Tyr Phe Pro Met His Gly Asn Leu He Phe Gly Lys 
530 535 



Gin Gly Thr Gly Thr Thr Asn Val Asp He Glu Ser Val Leu He Thr 
545 550 

Asp Glu Glu Glu lie Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 

Gly Gin Val Ala Thr Asn His Gin Ser Gin Asn Thr Thr Ala Ser Tyr 
580 585 

Gly Ser Val Asp Ser Gin Gly lie Leu Pro Gly Met Val Trp Gin Asp 
555 

Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys Thr Pro His Thr 
610 615 

Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu ly. 
625 630 

His Pro Pro Pro Gin He Leu He Lys Ajn Thr Pro Val Pro Ala Asn 

Pro Ala Thr Thr Phe Thr Pro Gly Lys Phe Ala Ser Phe He Thr Gin 
660 665 
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Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin Lys 
675 680 685 

Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 

Asn Lys Ser Val Asn Val Glu Phe Thr Val Asp Ala Asn Gly Val Tyr 
705 710 715 720 

Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 68 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone A3. 4 
<400> 68 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
15 10 15 

Glu Gly He Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 

Lys Pro Asn Gin Gin His Arg Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 1 ^ 55 60 

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

His Gin Leu Lys Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Ala Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro He Glu Gin Ser Pro Ala Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 155 160 

Glu Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 
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Gly Asp Thr Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 ~ 200 205 



Ala Pro Met Ala Asp Asp Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val He 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Glu Ser Gly Ala Thr Asn Asp Asn His Tyr 
2 60 2 65 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp 

290 295 300 

Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin Val 
305 ~ 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr He Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 

385 390 395 400 

Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 

405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 
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Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Lys Thr 
435 440 445 

Gin Gly Thr Ser Gly Thr Thr Gin Gin Ser Arg Leu Gin Phe Ser Gin 
450 " 455 460 

Ala Gly Pro Ser Ser Met Ala Gin Gin Ala Lys Asn Trp Leu Pro Gly 
465 470 475 480 

Pro Ser Tyr Arg Gin Gin Arg Met Ser .Lys Thr Ala Asn Asp Asn Asn 
485 490 495 



Asn Ser Glu Phe Ala Trp Thr Ala Ala Thr Lys Tyr Tyr Leu Asn Gly 
500 505 510 

Arg Asn Ser Leu Val Asn Pro Gly Pro Pro Met Ala Ser His Lys Asp 
515 520 525 

Asp Glu Glu Lys Tyr Phe Pro Met His Gly Asn Leu He Phe Gly Lys 
530 " 535 540 

Gin Gly Thr Gly Thr Thr Asn Val Asp He Glu Ser Val Leu He Thr 
545 550 555 560 

Asp Glu Glu Glu He Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Gin Val Ala Thr Asn His Gin Ser Gin Asp Thr Thr Ala Ser Tyr 
580 585 590 



Gly Ser Val Asp Ser Gin Gly He Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 

Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys Thr Pro His Thr 
610 615 620 

Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 

His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 

Pro Ala Thr Thr Phe Thr Pro Gly Lys Phe Ala Ser Phe He Thr Gin 
660 665 670 

Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 680 685 
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Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 



Asn Lys Ser Val Asn Val Glu Phe Thr Val Asp Ala Asn Gly Val Tyr 
705 710 715 720 



Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 " 730 735 

<210> 69 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone A3 . 5 
<400> 69 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1 5 10 15 



Glu Gly He Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 



Lys Pro Asn Gin Gin His Arg Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



His Gin Leu Lys Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Ala Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro He Glu Gin Ser Pro Ala Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 155 160 



Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Thr Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
180 185 190 
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Ala Ala Pro Ser Gly Val Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 

210 215 220 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val lie 
225 230 235 240 

Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Ser Glu Ser Gly Ala Thr Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp 
290 295 300 

Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn lie Gin Val 
305 310 315 320 

Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr lie Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 

Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 

Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ser Lys Thr 
435 440 445 
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Gin Gly Thr Ser Gly Thr Thr Gin Gin Ser Arg Leu Gin Phe Asn Gin 
450 455 460 



Ala Gly Pro Ser Ser Met Ala Gin Gin Ala Lys Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Ser Tyr Arg Gin Gin Arg Met Ser Lys Thr Ala Asn Asp Asn Asn 
485 490 495 



Asn Ser Glu Phe Ala Trp Thr Ala Ala Thr Lys Tyr Tyr Pro Asn Gly 
500 505 510 



Arg Asn Ser Leu Val Asn Pro Gly Pro Pro Met Ala Ser His Lys Asp 
515 520 525 



Asp Glu Glu Lys Tyr Phe Pro Met His Gly Asn Leu lie Phe Gly Lys 
530 ** 535 540 



Gin Gly Thr Gly Thr Thr Asn Val Asp lie Glu Ser Val Leu lie Thr 
545 550 555 560 



Asp Glu Glu Glu lie Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Gin Val Ala Thr Asn Arg Gin Ser Gin Asn Thr Thr Ala Ser Tyr 
580 585 590 



Gly Ser Val Asp Ser Gin Gly He Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys Thr Pro His Thr 
610 615 620 



Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Ala Thr Thr Phe Thr Pro Gly Lys Phe Ala Ser Phe He Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 
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Asn Lys Ser Val Asn Val Glu Phe Thr Val Asp Ala Asn Gly Val Tyr 
705 710 715 720 

Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr leu Thr Arg Asn Leu 
725 730 735 



<210> 70 
<211> ' 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV2 
<400> 70 

•Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
15 10 15 



Glu Gly lie Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 

Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 ~ ^ 55 60 

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Arg Gin Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 
145 150 155 160 

Lys Ala Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ala Asp Ser Val Pro Asp Pro Gin Pro Leu Gly Gin Pro Pro 
180 185 190 
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Ala Ala Pro Ser Gly Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val He 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Gin Ser Gly Ala Ser Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin Val 
305 ~ 310 315 320 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr lie Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys .Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr 
435 440 445' 
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Asn Thr Pro Ser Gly Thr Thr Thr Gin Ser Arg Leu Gin Phe Ser Gin 
450 455 460 



Ala Gly Ala Ser Asp lie Arg Asp Gin Ser Arg Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Ser Ala Asp Asn Asn 
485 490 495 



Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 
500 505 510 



Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 
515 520 525 



Asp Glu Glu Lys Phe Phe Pro Gin Ser Gly Val Leu lie Phe Gly Lys 
530 535 540 



Gin Gly Ser Glu Lys Thr Asn Val Asp lie Glu Lys Val Met He Thr 
545 550 555 560 



Asp Glu Glu Glu He Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Ser Val Ser Thr Asn Leu Gin Arg Gly Asn Arg Gin Ala Ala Thr 
580 585 590 



Ala Asp Val Asn Thr Gin Gly Val Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr 
610 615 620 



Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe He Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 ~ 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 
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Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 
705 710 715 720 



Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
• 725 730 735 

<210> 71 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV 3 
<400> 71 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly lie Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gin Pro 
20 25 30 



Lys Ala Asn Gin Gin His Gin Asp Asn Arg Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
50 ^ 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg lie Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Gly 
130 135 140 

Ala Val Asp Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly Val Gly 
145 ~ 150 155 160 

Lys Ser Gly Lys Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin Thr 
165 - 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 
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Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Gin Trp Leu Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 ~ 250 255 



Tyr Lys Gin lie Ser Ser Gin Ser Gly Ala Ser Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Lys Leu Ser Phe Lys Leu Phe Asn lie Gin Val 
305 " 310 315 320 



Arg Gly Val Thr Gin Asn Asp Gly Thr Thr Thr lie Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg Thr 
435 440 445 



Gin Gly Thr Thr Ser Gly Thr Thr Asn Gin Ser Arg Leu Leu Phe Ser 
450 455 460 
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Gin Ala Gly Pro Gin Ser Met Ser Leu Gin Ala Arg Asn Trp Leu Pro 
465 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg Leu Ser Lys Thr Ala Asn Asp Asn 
485 490 495 



Asn Asn Ser Asn Phe Pro Trp Thr Ala Ala Ser Lys Tyr His Leu Asn 
500 505 510 

Gly Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys 

515 520 525 



Asp Asp Glu Glu Lys Phe Phe Pro Met His Gly Asn Leu lie Phe Gly 
530 " 535 540 



Lys Glu Gly Thr Thr Ala Ser Asn Ala Glu Leu Asp Asn Val Met lie 
545 " 550 555 560 



Thr Asp Glu Glu Glu lie Arg Thr Thr Asn Pro Val Ala Thr Glu Gin 
565 570 575 



Tyr Gly Thr Val Ala Asn Asn Leu Gin Ser Ser Asn Thr Ala Pro Thr 
580 585 590 



Thr Gly Thr Val Asn His Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 



Asp Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
610 615 620 



Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 

625 ~ 630 635 640 

Lys His Pro Pro Pro Gin lie Met lie Lys Asn Thr Pro Val Pro Ala 

645 650 655 



Asn Pro Pro Thr Thr Phe Ser Pro Ala Lys Phe Ala Ser Phe lie Thr 
660 665 670 

Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin 
675 680 685 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn 
690 695 700 

Tyr Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val 
705 710 715 720 
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Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 72 

<211> 737 

<212> PRT 

<213> capsid protein of AAV serotype, clone 3.3bVPl 

<400> 72 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Asn Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 ~ 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 
0 130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly He 
145 150 ,155 160 

Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro 
180 185 190 



Pro Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly 
195 200 205 
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Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
210 215 220 

Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 

lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Glu Gin He Ser Ser Glu Thr Ala Gly Ser Thr Asn Asp Asn 
260 265 270 

Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 ' 280 285 

Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn 
290 295 300 

Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn He 
305 " 310 315 320 

Gin Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr He Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 

Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 

Gly Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 

Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser 
405 410 415 

Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 

Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala 
435 440 445 

Arg Thr Gin Ser Asp Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin 
450 455 460 
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Phe Tyr Gin Gly Gly Pro Ser Thr Met Ala Glu Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie 
530 " 535 540 



Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn Val Leu 
545 " 550 555 560 



Met Thr Asn Glu Glu Glu He Arg Pro Thr Asn Pro Val Ala Thr Glu 
565 570 575 



Glu Tyr Gly He Val Ser Ser Asn Leu Gin Ala Ala Asn Thr Ala Ala 
580 585. 590 



Gin Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp 
595 600 605 



Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro 
610 615 620 



His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 
625 630 635 640 



Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro 
645 650 655 



Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He 
660 665 670 



Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu 
675 680 685 



Gin Lys Glu Asn Ser Lys Arg Trp Asp Pro Glu He Gin Tyr Thr Ser 
690 695 700 



Asn Phe Glu Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly 
705 ~ 710 715 720 
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Val Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn 
725 730 735 



Leu 



<210> 73 
<211> 644 
<212> PRT 

<213> capsid protein of AAV serotype, clone 223-4 
<400> 73 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
15 10 15 



Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 



Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 



Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 

Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly lie Gly 
65 70 75 80 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 



Gly Asp Ser Glu Pro Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
100 105 110 



Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 ^ 120 125 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 



Ser Gly Asn Trp His Cys Asp Ser Thr Arg Leu Gly Asp Arg Val lie 
145 ~ 150 155 160 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 



Tyr Lys Gin lie Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 
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His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
210 215 220. 



Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Gly 
340 345 350 



Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu He Phe 
450 455 460 
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Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 

Thr Asn Glu Glu Glu He Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 

Tyr Gly He Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 

Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
530 ^ 535 540 

Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 ' 550 555 560 

Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
565 57 0 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He Thr 
580 585 590 

Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 615 620 

Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 



Tyr Ser Glu Pro 



<210> 74 

<211> 644 

<212> PRT 

<213> capsid protein of AAV serotype, clone 223.5 

<400> 74 

Lvs Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
1 5 10 15 

Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 

Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 
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Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 



Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly lie Gly 
65 ^70 75 80 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
8 5 90 95 



Gly Asp Ser Glu Pro Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro 
100 105 110 



Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 



Ser Gly Asn Trp His Cys Asp Ser Thr Arg Leu Gly Asp Arg Val lie' 
145 ~ 150 155 160 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 



Tyr Lys Gin lie Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 
210 215 220 



Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn lie Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 
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Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 ' 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Gly 
340 345 350 



Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 ~ 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu He Phe 
450 455 460 



Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu He Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly He Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
530 535 540 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 
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Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 
565 570 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe lie Thr 
580 585 590 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 615 620 



Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 " 630 635 640 



Tyr Ser Glu Pro 



<210> 75 

<211> 644 

<212> PRT 

<213> capsid protein of AAV serotype, clone 223.10 
<220> 

<221> MIS COFEATURE 

<222> (434) . . (434) 

<22 3> can be any amino acid 

<400> 75 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
15 10 15 



Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 



Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 



Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 



Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly He Gly 
65 70 75 80 



Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
100 105 110 
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Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 " 140 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
145 150 155 " 160 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 



Tyr Lys Gin He Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
210 215 220 



Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
2 90 2 95 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 " 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
340 345 350 



Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



150 



WO 03/042397 



PCT/US02/33629 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 * 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 * 425 ^ 430 



Asn Xaa Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe 
450 455 460 



Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly lie Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
530 535 540 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
565 570 575 



Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He Thr 
580 585 590 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 615 620 
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Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 

Tyr Ser Glu Pro 



<210> 76 

<211> 644 

<212> PRT 

<213> capsid protein of AAV serotype, clone 223.2 

<400> 76 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
1 5 io 15 

Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Cys Leu Gin Glu Asp Thr 
20 25 ' 30 

Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 

Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 

Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly He Gly 
65 70 75 80 

Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 go 95 

Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
100 105 HO 

Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Val Ala Gly Gly Gly 
115 120 125 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He 
145 150 155 "* " 160 

Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 

Tyr Lys Gin He Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 
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His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn 
210 215 220 



Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn lie Gin 
225 230 235 240 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn 
245 250 255 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 



Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
340 345 350 



Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 "* 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 ^ 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Ser Pro Ser Ser Gly Val Leu He Phe 
450 455 460 
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Glv Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu He Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 

Tvr Glv He Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
y " J " 5Q0 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 

Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
530 535 540 

Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 

Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
565 570 575 

Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He Thr 
580 585 590 

Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 615 620 

Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 &4U 



Tyr Ser Glu Pro 

<210> 77 

<211> 644 

<212> PRT _ 

<213> capsid protein of AAV serotype, clone 223.7 

<400> 77 

Lys Ala Tyr Asp Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg 
X 5 10 

Tyr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 
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Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 

Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 

Gly Lys Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly lie Gly 
65 * " 70 75 80 

Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 

Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
100 105 110 

Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He 
145 "* 150 155 160 

Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 

Tyr Lys Gin He Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 

His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
210 215 220 

Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin 
225 ^ 230 235 240 

Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
245 250 255 

Leu Thr Ser Thr Val Gin Val Phe Ser Asp Pro Glu Tyr Gin Leu Pro 
260 265 270 

Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 
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Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 



Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 



Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
340 345 350 



Arg » Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 365 



Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 



Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 
385 390 395 ~ 400 



Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 430 



Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 



Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu He Phe 
450 455 460 



Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 



Thr Asn Glu Glu Glu He Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 



Tyr Gly He Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 



Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 " 525 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
530 535 540 
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Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 

Lvs His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
565 570 575 

Asn Pro Pro Glu Val Phe Thr Pro Ala Lys He Ala Ser Phe He Thr 
580 585 590 

Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 

Lys Glu Asn Ser lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 615 620 

Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 



Tyr Ser Glu Pro 

<210> 78 

<211> 644 

<212> PRT 

<213> capsid protein of AAV serotype, clone 223.6 

<400> 78 

Lvs Ala Tyr Asp Gin Gin Leu lys Ala Gly Asp Asn Pro Tyr Leu Arg 
1 5 10 15 

Tvr Asn His Ala Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr 
20 25 30 

Ser Phe Gly Gly Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg 
35 40 45 

Val Leu Glu Pro Leu Gly Leu Val Glu Thr Pro Ala Lys Thr Ala Pro 
50 55 60 

Glv Lvs Lys Arg Pro Val Asp Ser Pro Asp Ser Thr Ser Gly He Gly 
65 70 75 80 

Lvs Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
85 90 95 

Glv Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro 
100 105 HO 

Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly 
115 120 125 
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Ala Pro Met Ala Asp Asn Ser Glu Gly Ala Asp Gly Val Gly Asn Ala 
130 135 140 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He 
145 150 155 160 

Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
165 170 175 

Tyr Lys Gin He Ser Ser Gin Ser Ala Gly Ser Thr Asn Asp Asn Val 
180 185 190 

Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
195 200 205 

His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
210 215 220 

Trp Gly Phe Arg Pro Lys Lys Leu Asn Phe Lys Leu Phe Asn He Gin 
225 ~ 230 235 240 

Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
245 250 255 

Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
260 265 270 

Tvr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
275 280 285 

Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
290 295 300 

Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
305 310 315 320 

Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
325 330 335 

Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
340 345 350 

Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg 
355 360 3 65 

Thr Gin Ser Asn Ala Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin Phe 
370 375 380 
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Tyr Gin Gly Gly Pro Thr Thr Met Ala Glu Gin Ala Lys Asn Trp Leu 

385 ' ~ 390 395 400 

Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp Gin 
405 410 415 



Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu 
420 425 430 

Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His 
435 440 445 

Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe 
450 455 460 

' Gly Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met 
465 470 475 480 

Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu Glu 
485 490 495 

Tyr Gly lie Val Ser Ser Asn Leu Gin Ala Ala Ser Thr Ala Ala Gin 
500 505 510 

Thr Gin Val Val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
515 520 525 

Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
530 535 540 

Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
545 550 555 560 

Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
565 570 575 

Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Leu Ala Ser Phe He Thr 
580 585 590 

Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
595 600 605 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
610 * 615 620 

Phe Asp Lys Gin Thr Gly Val Asp Phe Ala Val Asp Ser Gin Gly Val 
625 630 635 640 
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Tyr Ser Glu Pro 



<210> 79 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 44.1 
<400> 79 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 ~ 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly He 
145 150 155 160 

Gly Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 

Thr Gly Asp ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 

Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 
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Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 " 230 235 240 

lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 

Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 * " 310 315 320 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 

Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 " 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 

Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 4 60 
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Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys He 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 " 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 710 715 720 
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Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
Y 725 730 735 



Asn Leu 



<210> 80 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 44.5 

<400> 80 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 ~ ~ 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 

Gly Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 17 0 17 5 

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 

Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 
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Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 

He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 

Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 

Ara Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 . 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Pro Asn Phe Lys Leu Phe Asn 
305 * 310 315 320 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 

Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 ' 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 

Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 
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Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 

Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 

His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 

Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 ~ 555 560 

Met Leu Thr Ser Glu Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 

Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 

Pro He Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 

Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 ~ 615 620 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 J 630 635 640 

Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 655 

Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 

He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 

Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 

Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 710 715 720 
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Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 81 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 44.2 

<400> 81 , 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

1 5 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 ~ 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 . 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 

Gly Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 

Thr Glv Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 

Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 
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Glv Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
* 210 215 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 " 230 235 240 

lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 

Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 ~ 310 315 320 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 

Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 ' 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 

Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 
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Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 

Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 

Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 

His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 

Phe Gly lys Gin Gly Ala Gly lys Asp Asn Val Asp Tyr Ser Ser Val 
545 * ~ 550 555 560 

Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 

Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 

Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 

Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 

Glv Leu Lys His Pro Pro Pro Gin lie. Leu lie Lys Asn Thr Pro Val 
645 650 655 

Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 

lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 

Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr 
690 695 700 

Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 ' * 710 715 720 
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Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 82 

<211> 738 

<212^ PRT 

<213> capsid protein of AAV serotype, clone 29.3VP1 

<400> 82 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Gly He Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " = 5 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 9° 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro 
145 



Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Thr Thr Gly lie 
150 155 160 



Glv Lys Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
Y 165 170 175 

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 I 85 I 90 

pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 
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Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 

He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 

Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 " 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Leu Gly Ser Ala Arg Gin Gly Cys Leu Pro Pro Phe 
355 360 365 

Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 ~ 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 

Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin' Tyr Leu Tyr Tyr Leu 
435 440 445 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 
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Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 " 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 * 535 540 

Phe Gly Lys Gin Gly Ala" Gly Lys Gly Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 

Met Leu Thr Ser Glu Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro He Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 615 620 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 

Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 

He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 

Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 

Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 ** " 710 715 720 
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Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 83 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 29.5VP1 

<400> 83 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Gly He Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 "* 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly He 
145 150 155 160 

Glv Lvs Lys Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 
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Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Gly Val 
225 " 230 235 240 



He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 . 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Ser Leu Asn Phe Lys Leu Phe Asn 
305 "* 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys . Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 
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Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 

Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 4 90 495 

Gin Asn Asp Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 

Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 

His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 ~ 535 540 

Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 

Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 

Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 

Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 

Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 " 630 635 640 

Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 

He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 

Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 
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Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Asp 
705 710 715 720 

Glv Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 84 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 42.15 

<400> 84 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 . 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 ~ 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly He 
145 150 155 160 

Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 
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Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 

225 230 . 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 " 310 315 320 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Pro Pro Pro Phe 

355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 ^ 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Arg Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 
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Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 



Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 ~ * 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 



lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 ~ * 680 685 



Leu Gln^Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr 
690 695 700 
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Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 ~ 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 85 
<211> 738 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.8 
<400> 85 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 2 5 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 
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Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 



• Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 ' 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 ~ 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 
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Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
„rX 4 55 4 60 



450 



Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 

Am 475 flou 

Arg Val- Ser Thr Thr Leu 
485 490 495 



465 470 475 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val' Ser Thr Thr Leu Ser 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 iJ - 

Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 

His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 540 

Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 

Met Leu Thr Ser Glu Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr 
565 570 3/0 

Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 39° 

Pro He Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 

Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 



625 



Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
645 650 b3 ° 

Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 

He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 
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Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 86 
<211> 733 
<212> PRT 

<213> amino acid of AAV serotype, clone 42.13 
<400> 86 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro He Glu Ser Pro Asp Ser Ser Thr Gly He Gly Lys Lys Gly Gin 
145 150 155 160 

Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 
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Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 

GlY Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 ~ 200 205 

ASP Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp 
210 215 220 

His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 ~ 230 235 240 

Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 

Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn Thr Tyr Phe Gly 
260 265 270 

Tvr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 
275 280 285 

Phe ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp Gly Phe 
290 295 300 

Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin Val Lys Glu 
305 " 310 315 -320 

Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala Asn Asn Leu Thr Ser 
325 330 335 

Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
340 345 350 

Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
355 360 365 

Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala 
370 " 375 380 

Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
385 390 395 400 

Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val 
405 410 415 

pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met 
420 425 430 
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Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser 
435 440 445 

Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly 
450 " 455 460 

Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys 
465 470 475 480 

Tyr Arg Gin Gin Arg Val Ser Thr Thr Val Ser Gin Asn Asn Asn Ser 
J 485 490 495 

Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp 
500 505 510 

Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His Lys Gly Asp Glu 
515 520 525 

Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly 
530 535 540 

Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu 
545 550 555 

Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val 
565 570 S' 3 

Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro He Val Gly Ala 
580 585 590 

Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 &05 

Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr Asp Gly 
610 * 615 620 

Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 

Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asp Pro Pro 
645 650 655 

Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe He Thr Gin Tyr Ser 
660 665 6/0 

Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn 
675 680 685 
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Ser Lvs Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn Tyr Tyr Lys 
690 ~ 695 700 

Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu 

710 715 720 



705 



Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Ser Leu 
725 730 



<210> 87 

<211> 733 

< 2 1 2 > PRT 

<213> capsid protein of AAV serotype, clone 42. 3A 

Met°Ala 8 Ala Asp Gly His Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 ^ 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro He Glu Ser Pro Asp Ser Ser Thr Gly He Gly Lys Lys Gly Gin 
145 150 155 160 

Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 1"5 

Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro Ala Gly Pro Ser 
180 I 85 190 
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Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 

Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp 
210 215 220 

His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 " 230 235 240 

Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 

Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn Thr Tyr Phe Gly 
260 265 270 

Tvr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 
275 ' 280 285 

Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Ser Trp Gly Phe 
290 295 300 

Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin Val Lys Glu 
305 310 315 320 

Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala Asn Asn Leu Thr Ser 
325 330 335 

Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
340 345 350 

Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
355 360 365 

Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala 
370 375 380 



Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
385 390 395 400 

Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val 
405 410 415 

Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met 
420 " 425 430 

Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser 
435 440 445 
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Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly 
4 50 455 4 60 



Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys 
465 470 475 480 



Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser Gin Asn Asn Asn Ser 
485 490 495 



Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp 
500 505 510 



Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His Lys Asp Asp Glu 
515 520 525 



Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly 
530 535 540 



Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu 
545 " 550 555 560 



Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val 
565 570 575 



Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro lie Val Gly Ala 
580 585 590 



Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 



Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly 
610 615 620 



Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 



Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala Asp Pro Pro 
645 650 655 



Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe lie Thr Gin Tyr Ser 
660 665 670 



Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn 
675 680 685 



Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn Tyr Tyr Lys 
690 695 700 
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Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu 
705 710 715 720 

Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 



<210> 88 

<211> 731 

<212> PRT 

<213> capsid protein of AAV serotype, clone 42.4 

<400> 88 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 ~ ' 55 60 

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 " 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 'HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro He Glu Ser Pro Asp Ser Ser Thr Gly He Gly Lys Lys Gly Gin 
145 150 155 160 

Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro Ala Gly Pro Ser 
180 1B5 190 
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Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 *" 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Ser Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin Val Lys Glu Val Thr 
305 310 315 320 



Gin Asn Glu Gly Thr Lys Thr He Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Thr Asp Ser Glu Tyr Arg Leu Pro Tyr Val Leu Gly Ser 
340 ~ 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser Thr Gly 
435 440 445 
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Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly Pro Asn 
450 455 460 



Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg 
465 470 475 480 



Gin Gin Arg Val Ser Thr Thr Leu Ser Gin Asn Asn Asn Ser Asn Phe 
485 490 495 



Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp Ser Leu 
500 505 510 



Val Asn Pro Gly Val Ala Met Ala Thr His Lys Asp Asp Glu Glu Arg 
515 520 525 



Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly Ala Gly 
530 535 540 



Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu Glu Glu 
545 550 555 560 



lie Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val Val Ala 
565 570 575 



Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro lie Val Gly Ala Val Asn 
580 585 590 



Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr 
595 600 605 



Leu Gin Gly Pro lie Trp Ala Lys He Pro His Thr Asp Gly Asn Phe 
610 615 620 



His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro 
625 630 635 640 



Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr 
645 650 ■ 655 



Phe Ser Gin Ala Lys Pro Ala Ser Phe He Thr Gin Tyr Ser Thr Gly 
660 665 670 



Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys 
675 680 685 



Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr 
690 695 700 
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Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu Pro Arg 
705 710 715 720 



Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 



<210> 89 

<211> 731 

<212> PRT 

<213> capsid protein of AAV serotype, clone 42. 5A 

<400> 89 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
! 5 10 . 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " 55 60 

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Arg Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 

Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 

Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 

Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 
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Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 ~ 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 

Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 2 55 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 " 280 285 



Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Arg Gly Phe Arg Pro 
290 295 300 

Arg Lys Leu Arg Phe Lys Leu Phe Asn He Gin Val Lys Glu Val Thr 

305 310 315 320 



Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 

Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 " ^ 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Ty* Tyr Leu Ser Arg Thr Gin Ser Thr Gly 
435 440 445 



Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly Pro Asn 
450 "* 455 460 
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Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg 
465 470 475 480 



Gin Gin Arg Val Ser Thr Thr Leu Ser Gin Asn Asn Asn Ser Asn Phe 
485 490 495 



Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp Ser Leu 
500 505 510 



Val Asn Pro Gly Val Ala Met Ala Thr His Lys Asp Asp Glu Glu Arg 
515 520 525 



Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly Ala Gly 
530 535 540 



Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu Glu Glu 
545 550 555 560 



He Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val Val Ala 
565 570 575 



Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro He Val Gly Ala Val Asn 
580 585 590 



Ser Gin Gly Ala Leu Pro Gly Met Ala Trp Gin Asn Arg Asp Val Tyr 
595 600 605 



Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr Asp Gly Asn Phe 
610 615 620 



His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro 
625 630 635 640 



Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Thr 
645 650 655 



Phe Ser Gin Ala Lys Leu Ala Ser Phe He Thr Gin Tyr Ser Thr Gly 
660 665 670 



Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys 
675 680 685 



Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser Asn Tyr Tyr Lys Ser Thr 
690 695 700 



Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu Pro Arg 
705 710 715 720 
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Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 



<210> 90 

<211> 733 

<212> PRT 

<213> capsid protein of AAV serotype, clone 42. IB 



Met°Ala 9 Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

10 J- 5 



1 3 

Glu Gly He Arg Glu Trp Trp Asp Leu Arg Pro Gly Ala Pro Lys Pro 
20 25 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
y 35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " 55 6° 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 



70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 

90 93 



85 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 H° 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 I 40 

Pro He Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 

Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 I 75 

Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 I 90 

Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
19 5 " 200 205 

Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser Ser Gly Asn Trp 
210 215 220 
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His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 

Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
24 5 250 2 55 

Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn Thr Tyr Phe Gly 
260 265 270 

Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 
275 ' 280 285 

Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe 
290 " 295 300 

Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin Val Lys Glu 
305 ^ " 310 315 320 

Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala Asn Asn Leu Thr Ser 
325 330 335 

Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
340 345 350 

Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
355 360 365 

Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ala 
370 " 375 380 

Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
385 " 390 395 400 

Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Gin Phe Glu Asp Val 
405 410 415 



Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met 
420 425 430 

Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr Gin Ser 
435 440 445 

Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu Phe Ser Gin Ala Gly 
4 50 455 460 

Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp Leu Pro Gly Pro Cys 
465 470 475 480 
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Tvr Arg Gin Gin Arg Val Ser Thr Thr Val Ser Gin Asn Asn Asn Ser 
485 490 495 

Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asp 
500 505 510 

Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr His Lys Gly Asp Glu 
515 520 525 

Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met Phe Gly Lys Gin Gly 
530 535 540 

Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val Met Leu Thr Ser Glu 
545 * 550 555 560 

Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val 
565 570 575 

Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala Pro He Val Gly Ala 
580 585 590 

Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp 
595 600 605 

Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr Asp Gly 
610 615 620 

Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro 
625 630 635 640 

Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asp Pro Pro 
645 650 655 

Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe He Thr Gin Tyr Ser 
660 665 670 

Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn 
675 680 685 

Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Tyr Lys 
690 " 695 700 

Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu 
705 710 715 720 

Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 



195 
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<210> 91 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 42 . 5B 

<400> 91 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Glv Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Lvs Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly He 
145 150 155 160 

Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 17 5 

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 

Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 

Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 

He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 2 50 2 55 
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Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 

Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 

Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Ala Gly Thr Gin Gin Leu Leu 
450 455 460 

Phe Ser Gin Ala Gly Pro Asn Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 

Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 
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Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 ~ 520 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 " 535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Gin Asn Ala Ala 
580 585 590 



Pro lie Val Gly Ala Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie 
610 615 620 



Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys "Leu Ala Ser Phe 
660 665 " 670 



lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr 
690 695 700 



Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Thr Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 

Asn Leu 



<210> 92 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 43.1 
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<400> 92 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Glv lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

<» 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Glv Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly He 
145 150 155 160 

Glv Lvs Lys Gly His Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 170 175 

Thr Glv Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 
180 185 190 

Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 

Glv Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 

He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 
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Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 ~ 310 315 320 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Pro Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 

Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 

Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Gin Gly Thr Gin Gin Leu Leu 
450 455 460 

Phe Ser Gin Ala Gly Pro Ala Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 

Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 

Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 
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His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 
530 535 

Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 

Met Leu Thr Ser Glu Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 

Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Thr Asn Gly Ala 
580 585 590 

Pro He Val Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 

Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys He 
610 615 620 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 *™ 

Gly Leu Lys His Pro Pro Pro Gin He Leu Val Lys Asn Thr Pro Val 
645 650 o55 

Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 

He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 

Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 

Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 

Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 ' 



Asn Leu 



<210> 93 

<211> 738 

<212> PRT 

<213> capsid protein of AAV serotype, clone 43. 



201 



WO 03/042397 



PCT/US02/33629 



Me 0 t°Ala"la Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
x 5 10 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
y 35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 



100 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 1 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 



Gly Lys Lys Gly His Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
Thr Gly Asp ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 



180 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 

Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 

He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 



245 



250 
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Leu 



Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 

- --- ~ 2 65 



2 60 



Asn Thr Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 28 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 J1 



Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr lie Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr lie Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 3 45 350 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 



355 



360 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 



385 



390 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 

Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 

Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Gin Gly Thr Gin Gin Leu Leu 
450 455 460 

Phe Ser Gin Ala Gly Pro Ala Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 

Lys 

500 505 510 



485 4 90 
Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
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Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 

Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 

Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Gin Gly Thr Gin Gin Leu Leu 
450 455 460 

Phe Ser Gin Ala Gly Pro Ala Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 480 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Leu Ser 
485 490 495 

Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 510 
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• Le u Asn Gly Arg Asp Ser Leu Val Asn Pro Sly Val Ala Met Ala Thr 
. 515 520 

His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 



530 535 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 



545 



550 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 

Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Thr Asn Gly Ala 
580 585 



Pro lie Val Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 

Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys He 
610 615 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 

Gly Leu Lys His Pro Pro Pro Gin He Leu Val Lys Asn Thr Fro Val 
645 6tlU 

Pro Ala Asp Pro Pro Thr Thr Phe s.c Gin Ala Lys Leu Ala Ser Phe 



660 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 ° J 

Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 

ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp *h. Ala Val Asn Thr Glu 
705 710 

Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 ' JU 



Asn Leu 



<210> 94 
<211> 738 

<213> capsid protein of AAV serotype, clone 43.5 
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MetMa'ila Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 



50 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 

r 

85 . 90 



65 70 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 



Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 11 



Asn Leu 



Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 i4 '- 



115 120 125 

Leu 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 • 150 155 

Gly Lys Lys Gly His Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 

i 

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro He Gly Glu Pro 



180 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 2 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 2 15 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 

230 235 

Thr Trp Ala Leu Pro Thr Tyr Asn 
245 " " "0 255 



Gly 



225 

lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
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L eu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
2 60 265 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Ph. Asp Phe Asn 
275 280 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 



290 



295 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 31b 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr lie Ala 
325 330 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 36:3 



Pro 



Ala Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 39° 3 = '' 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 



Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 

fl £ «3 



420 



Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 

Ser Arg Thr Gin Ser Thr Gly Gly Thr Gin Gly Thr Gin Gin Leu Leu 
450 455 

Phe Ser Gin Ala Gly Pro Ala Asn Met Ser Ala Gin Ala Lys Asn Trp 
465 470 475 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val ser Thr Thr Leu Ser 



485 



Gin Asn Asn Asa Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 Di 
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Leu Asn Gly Arg Asp Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 

Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Ser Gly Val Leu Met 



His 

530 



535 540 



Phe Gly Lys Gin Gly Ala Gly Lys Asp Asn Val Asp Tyr Ser Ser Val 
545 550 555 

Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 3'° 



Glu Gin Tyr Gly Val Val Ala Asp Asn Leu Gin Gin Thr Asn Gly Ala 
580 585 

Pro lie Val Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 

Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys He 
610 615 620 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 



625 



630 635 



Gly Leu Lys His Pro Pro Pro Gin He Leu Val Lys Asn Thr Pro Val 
645 650 b:>3 

Pro Ala Asp Pro Pro Thr Thr Phe Ser Gin Ala Lys Leu Ala Ser Phe 
660 665 670 

lie Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 

Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 

Ser Asn Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 

Gly Thr Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 



Asn Leu 



<210> 95 
<211> 738 



<213> capsid protein of AAV serotype, clone AAV8 
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<400> 95 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 

Glu Gly He Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 



35 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

55 °° 



50 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 ™ ^ 5 

Gin Gin Leu Gin Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 i° 5 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 I 55 

Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 

Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro 
180 185 

Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly 
195 200 205 

Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 

ser ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 



225 



230 



208 



WO 03/042397 



PCTAJS02/33629 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 

Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Sly Ala Thr Asn Asp 
260 265 270 

Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 



275 



280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 

Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 
305 310 3 

He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr lie Ala 
325 330 335 

Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 

Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 

Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 

Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 

Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Thr Tyr 
405 410 

Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 



Ser Arg 
4 50 



Thr Gin Thr Thr Gly Gly Thr Ala Asn Thr Gin Thr Leu Gly 



455 4 6° 



Phe Ser Gin Gly Gly Pro Asn Thr Met Ala Asn Gin Ala Lys Asn Trp 
465 4 ™ 475 480 

Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Gly 
485 490 
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Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His 
500 505 

Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly He Ala Met Ala Thr 
515 520 

His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Asn Gly He Leu He 

535 i)4U 



530 



Phe Gly Lys Gin Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
545 ~ 550 555 

Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 

Glu Glu Tyr Gly He Val Ala Asp Asn Leu Gin Gin Gin Asn Thr Ala 
580 585 SVV 

Pro Gin He Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 



595 



600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 615 620 

Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 

Gly Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val 
* 645 650 655 



Pro 



Ala Asp Pro Pro Thr Thr Phe Asn Gin Ser Lys Leu Asn Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 

ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu 
705 710 

Gly Val Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
7 25 730 



Asn Leu 



210 
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<210> 96 

<211> 736 

<212> PRT 

<213> capsid protein of AAV serotype, clone 43.^1 

<400> 96 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 " 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 16° 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 

Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 

Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
X 95 200 205 

Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 
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Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He 
225 ^ 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn 
290 295 300 

Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He 
305 310 315 320 

Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr He Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Arg Val Phe Thr Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 ~ 360 365 



Ala Asp Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 

Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 

Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 

Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 

Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 475 480 
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Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Ser 
485 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 

Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 

Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 540 

Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 

Thr Asp Glu Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala He Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val He Pro Gly Met Val Trp Gin 
595 600 605 

> 

Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 

Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 

Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 

Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe He Thr 
660 665 670 

Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 1 680 685 

Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 "* 695 700 

Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 720 

Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 
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<210> 97 
<211> 736 

<l">> Ssid protein of «W serotype, done ,3.25 

££VL. «p w — »« «» r„ p sl " " p Asn S" s,t 

1 5 

„. «, ii. «g «. ««p »» L " "° Gly Rla 5° Iys Pt ° 

20 " 



Ly3 Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 

Gl y Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 

Sin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 
ASP Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 1Ub 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 



115 



120 



,eu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 

Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly He Gly 
145 150 

Lys Thr Gly Gin Gin Pro Ala Lys Lys Arj Leu Asn Phe Gly Gin Thr 
165 

Sly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 

M. Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 

Ala p ro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 

Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val II. 



225 230 
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Thr Thr .« Thr «, Thr to Ale » Jr. Tyr » » -J- — 

245 ZDU 

m . -„ rnv G iv ser Thr Asn Asp Asn 
Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly ^ 
260 Zb ^ 

Th* Tyr Ph. sly Tyr s.r Thr Pro Trp sly Tyr Ph. »p P»e Asn «, 



275 2»° 



Ph , „i, cy. His Ph. Be. Pro AT, »P Trp Sin «, L.U H- Asn » 



290 



295 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie 
305 310 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr He Ala Asn 
325 3JU 

„ L.U Thr s.r Tor V,! On v.! Ph. Tor Asp Ser Slu Tyr Sin Loo 
340 340 

,„ Tyr VOX L.U Sly Sor Al, U. «- U — J» "° "° 

355 350 

u , M p vol Ph. Mot v.! Pro Sin Tyr Sly Tyr L.U Thr Loo A» Asn 
370 3 7:5 

_ 7\ <^ o er . e~ r Phe Tyr Cys Leu Glu Tyr Phe 
Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe xyr y ^ 

385 390 

,„ S.r Sin «.t Leu Arg Thr «, Asn Asn Phe Gin Phe =« Tyr Thr 
405 * iu 

, , „ pvi o Hi 3 ser Ser Tyr Ala His Ser Gin Ser Leu 
Phe Glu Asp Val Pro Phe His Ser &er y 43Q 

420 

*op AT, Leu Met Asn Pro L.o O. A,p sin Tyr Leu Tyr Tyr L.u v.l 
435 

Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 4513 

«j Alo Sly Pro sor sor Hot Ale Asn Sin Ale Ar, Aon Trp v.! Pro 

Gly Pro eye Tyr Ar, sir, sin Ar, v.! s.r Thr Thr Thr Asn sin Aon 
4 85 
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A sn Asn sex Asn Phe Ala Txp Thx Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 

G1 y A xg Asp Sex Leu Met Asn Pxo Gly Val Ala Met Ala Sex His Lys 
515 520 

Asp Asp Asp Asp Axg Phe Phe Pxo Sex Sex Gly Val Leu He Phe Gly 
530 535 

Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyx Sex Gin Val Leu lie 



545 



550 



Thx Asp Glu Glu Glu He Lys Ala Thx Asn Pxo Val Ala Thx Glu Glu 
565 570 



Tyr Gly Ala Val Ala He Asn Asn Gin Ala Ala Asn Thx Gin Ala Gin 
580 585 

Thr Gly Leu Val His Asn Gin Gly Val He Pxo Gly Met Val Txp Gin 
595 600 

Asn Axg Asp Val Tyx Leu Gin Gly Pxo He Txp Ala Lys He Pxo His 
610 615 

Thx Asp Gly Asn Phe His Pxo Sex Pxo Leu Met Gly Gly Phe Gly Leu 
625 630 

Lys His Pxo Pxo Pxo Gin He Leu He Lys Asn Thx Pxo Val Pro Ala 
645 b0U 

As p Pxo Pxo Leu Thx Phe Asn Gin Ala Lys Leu Asn Sex Ph. He Thx 
660 6 " 

Gin Tyx Sex Thx Gly Gin Val Sex Val Glu He Glu Txp Glu Leu Gin 
675 680 



Lys Glu Asn sex Lys Axg Txp Asn Pxo Glu He Gin Tyx Thx Sex Asn 
690 695 

Tyx Tyx Lys Sex Thx Asn Val Asp Phe Ala Val Asn Thx Glu Gly Val 
705 710 

Tyx sex Glu Pxo Axg Pxo He Gly Thx Axg Tyx Leu Thx Axg Asn Leu 
725 730 
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<210> 98 
<211> 736 
<212> PRT 

<213> capsid protein of AAV serotype, clone 43.23 
<400> 98 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " ~ ' 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 
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Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr lie Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp Leu Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Pro Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Asn 
485 490 495 
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Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 



Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
5 65 " 57 0 575 



Tyr Gly Ala Val Ala lie Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val lie Pro Gly Met Val Trp Gin 
595 600 605 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 ~ 630 635 640 



Lys His Pro Pro Pro Gin He Leu lie Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe He Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 ~ 710 715 720 



Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 
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<210> 99 

<211> 736 

<212> PRT 

<213> capsid protein of AAV serotype, clone 43.20 

<400> 99 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40. " 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Leu Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 * 190 



Ala Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 
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Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 

Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr He Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Thr Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Asn 
485 490 495 
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Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Asp Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 



Thr Asp Glu Glu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala He Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val He Pro Gly Met Val Trp Gin 
595 600 605 



Asn Arg Asp Val Tyr Leu Gin Gly. Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 640 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe He Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 720 



Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 100 

<211> 736 

<212> PRT 

<213> capsid protein of AAV serotype, clone AAV9 
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400> 100 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 " 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 ■ 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 ^ " 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 " 160 



Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Glu Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He 
225 230 235 " 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 
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Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn 
290 295 300 



Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr He Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 475 " 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Asn 
485 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 



224 



WO 03/042397 



PCT/US02/33629 



Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 " 525 



Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 550 555 560 



Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala He Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val He Pro Gly Met Val Trp Gin 
595 600 ^ 605 



Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 ~ 640 



Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe He Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 720 



Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 101 

<211> 728 

<212> PRT 

<213> capsid protein of AAV serotype, clone 24.1 

<400> 101 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 is. 
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Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Arg Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn .Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Val Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 "* 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Ser Tyr Ser 
260 265 270 
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Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys I«eu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 " 310 315 320 



Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala* Asp Val Phe Met lie 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Val His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn lie Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 
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Pro lie Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 " 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu lie Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Cys Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 102 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.2REAL 
<400> 102 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 
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Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
B5 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly He Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



229 



WO 03/042397 PCT/US02/33629 

Pro lie Asn Gly Val Leu Val Phe Gly Glu Thr Gly Ala Ala Asn Lys 
530 535 540 

! o 

Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 

Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 

Gin Ser Ser Thr Ala Gly Prg Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 f 585 f 590 

Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu* Gin Gly 
595 ;V 600 605 

Pro He Trp Ala Lys He Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 

Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He Leu 
625 630 635 640 

He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 

* 

Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 67 0 

Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 

Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 

Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 

Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 103 

<211> 728 

<212> PRT 

<213> capsid protein of AAV serotype, clone 7.2VP1 

<400> 103 

Met Ala Ala* Asp Gly Tyr Leu Pro Asp Trp Leu Glu Gly Asn Leu Ser 
1 5 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 
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Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Arg Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
.50 55 60 

Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 

65 70 H <75 80 

Lvs Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Asn Gly Gin 
145 150 155 160 

Pro Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 

Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 

Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 

Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 

His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 

Arq Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 

Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
2 60 2 65 270 

Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 
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Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 " 310 315 320 



Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met lie 
355 360 3 65 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 ~ 375 380 

Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 

Thr Gly Asp Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 

Arg Leu Ser Lys Asn He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 

485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 
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Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg -Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro He Trp Ala Lys He Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He Leu 
625 630 635 640 



He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 104 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 27.3VP1 
<400> 104 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 
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Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 - 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Ser Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly A3p Arg Val He Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 
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Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn He Gin Val Lys Glu Val Thr 
305 " 310 315 320 



Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Cys Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Val 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Leu 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 
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Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 " 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Arg Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro He Trp Ala Glu He Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He Leu 
625 630 635 640 



He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 " 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 105 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 16.3VP1 
<400> 105 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 
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Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 " 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
• 100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 ~ 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 " 295 300 
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Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 ~ 310 315 320 



Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 * 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Met Gly 
370 ~ 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
4 85 4 90 4 95 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Gly Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 
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Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Gly Val Phe Thr Pro 
645 650 655 



Ala Leu Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu lie Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro lie Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 106 
<211> 728 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.10 
<400> 106 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



240 



WO 03/042397 



PCT/US02/33629 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys' Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Arg Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
2 45 2 50 " 2 55 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 " 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 
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Arg Lys Leu Arg Phe Lys Leu Phe Asn He Gin Val Lys Glu Val Thr 
305 310 315 320 



Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 " 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 
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Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn S.er Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu lie Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro lie Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 107 

<211> 728 

<212> PRT 

<213> capsid protein of AAV serotype, clone 42. 3B 

<400> 107 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 " 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 
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Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 7 0 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 '* 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 ' 175 



Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 310 315 320 
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Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr He 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met He 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 " 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin Gin 
465 470 475 480 



Arg Leu Ser Lys Asn He Asp Ser Asn Asn Thr Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 " 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 



Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Gin Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 



i 
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Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Leu 
625 630 635 640 



He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 670 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 108 

<211> 728 

<212> PRT 

<213> capsid protein of AAV serotype, clone 42.11 

<400> 108 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 
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Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp. Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Glu Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro Pro Ala Gly Pro Ser 
180 185 190 



Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Gin Ser Gly Ala Thr Asn Asp Asn His Phe Phe Gly Tyr Ser 
260 265 270 



Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe Ser 
275 280 285 



Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg Pro 
290 295 300 



Arg Lys Leu Arg Phe Lys Leu Phe Asn lie Gin Val Lys Glu Val Thr 
305 ~ 310 315 320 
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Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr lie 
325 330 335 



Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly Ser 
340 345 350 



Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met lie 
355 360 365 



Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val Gly 
370 375 ' 380 



Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 



Thr Gly Asn Asn Phe Glu Phe Ser Tyr Thr Phe Glu Glu Val Pro Phe 
405 410 415 



His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn Pro 
420 425 430 



Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr Thr 
435 440 445 



Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr Met 
450 455 460 



Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Arg Gin 
465 470 475 480 



Arg Leu Ser Lys Asp lie Asp Ser Asn Asn Asn Ser Asn Phe Ala Trp 
485 490 495 



Thr Gly Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr Asn 
500 505 510 



Pro Gly Val Ala Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe Phe 
515 520 525 



Pro lie Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn Lys 
530 535 540 



Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys Thr 
545 550 555 560 



Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn Leu 
565 570 575 
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Gin Ser Ser Thr Ala Gly Pro Gin Thr Gin Thr Val Asn Ser Gin Gly 
580 585 590 



Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly 
595 600 605 



Pro He Trp Ala Lys He Pro His Thr Asp Gly Asn Phe His Pro Ser 
610 615 620 



Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin He Leu 
625 630 635 640 



He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr Pro 
645 650 655 



Ala Lys Phe Ala Ser Phe He Thr Gin Tyr Ser Thr Gly Gin Val Ser 
660 665 " 670 



Val Glu He Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn 
675 680 685 



Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val Glu 
690 695 700 



Phe Ala Val Asn Asn Glu Gly Val Tyr Thr Glu Pro Arg Pro He Gly 
705 710 715 720 



Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 109 

<211> 729 

<212> PRT 

<213> capsid protein of AAV serotype, clone F1VP1 

<400> 109 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 

Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 
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Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Asp Ser Pro Asp Ser Ser Thr Gly He Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Ser Val Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 *" ~ 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 



Ser Ser Ser Ser Ser Gly Ala Thr Asn Asp Asn His Tyr Phe Gly Tyr 
260 265 270 



Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe 
275 " 280 285 



Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp Gly Phe Arg 
290 295 300 



Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn He Gin Val Lys Glu Val 
305 310 315 320 



Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr 
325 330 335 
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Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly 
340 345 350 



Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met 
355 360 365 



lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val 
370 375 380 



Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu 
385 390 395 400 



Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser Phe Glu Asp Val Pro 
405 410 415 



Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn 
420 425 430 



Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr 
435 440 445 



Thr Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr 
450 455 460 



Met Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin 
465 470 475 480 



Gin Gly Leu Ser Lys Asn Leu Asp Phe Asn Asn Asn Ser Asn Phe Ala 
485 490 495 



Trp Thr Ala Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr 
500 505 510 



Asn Pro Gly lie Pro Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe 
515 520 525 



Phe Pro lie Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn 
530 535 540 



Lys Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu lie Lys 
545 550 555 560 



Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn 
565 570 575 



Leu Gin Pro Ser Thr Ala Gly Pro Gin Ser Gin Thr lie Asn Ser Gin 
580 585 590 
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Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin 
595 600 605 



Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro 
610 ~ 615 620 



Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie 
625 630 635 640 



Leu lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr 
645 650 655 



Pro Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val 
660 665 670 



Ser Val Glu lie Glu Trp Glu Leu Gin. Lys Glu Asn Ser Lys Arg Trp 
675 " 680 " 685 



Asn Pro Glu lie Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val 
690 695 700 



Glu Phe Ala Val Asn Pro Asp Gly Val Tyr Thr Glu Pro Arg Pro lie 
705 710 715 720 



Gly Thr Arg Tyr Leu Pro Arg Asn Leu 
725 



<210> 110 
<211> 729 
<212> PRT 

<213> capsid protein of AAV serotype, clone F5VP1@3 
<400> 110 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 
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Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Asp Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 ^ 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Ser Val Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Thr Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val He Thr Thr Ser Thr 
225 ~ 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin He 
245 250 255 



Ser Ser Ser Ser Ser Gly Ala Thr Asn Asp Asn His Tyr Phe Gly Tyr 
260 265 270 



Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe 
275 280 285 



Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn Trp Gly Phe Arg 
290 ~ " " 295 300 



Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn He Gin Val Lys Glu Val 
305 310 315 320 



Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn Leu Thr Ser Thr 
325 330 335 



Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly 
340 345 350 
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Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met 
355 360 365 



He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser Gin Ser Val 
370 *" " 375 380 



Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu 
385 390 395 400 



Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser Phe Glu Asp Val Pro 
405 410 415 



Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn 
420 425 430 



Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr 
435 440 445 



Thr Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr 
450 455 460 



Met Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin 
465 470 475 480 



Gin Arg Leu Ser Lys Asn Leu Asp Phe Asn Asn Asn Ser Asn Phe Ala 
485 490 495 



Trp Thr Ala Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr 
500 505 510 



Asn Pro Gly He Pro Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe 
515 520 525 



Phe Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn 
530 535 540 



Lys Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys 
545 550 555 560 



Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn 
565 570 575 



Leu Gin Ser Ser Thr Ala Gly Pro Gin Ser Gin Thr He Asn Ser Gin 
580 585 590 



Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin 
595 600 605 
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Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro 
610 " 615 620 

Ser Pro Leu Met Gly Gly Phe Gly LeuGlu His Pro Pro Pro Gin lie 
625 630 635 640 

Leu lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr 
645 650 655 



Pro Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val 
660 665 670 



Ser Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp 
675 680 685 



Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val 
690 695 700 



Glu Phe Ala Val Asn Pro Asp Gly Val Tyr Thr Glu Pro Arg Pro He 
705 710 715 720 



Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 111 

<211> 729 

<212> PRT 

<213> capsid protein of AAV serotype, clone F3VP1 

<400> 111 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 
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Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 HO 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 ~ 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro lie Gly Ser Pro Asp Ser Ser Thr Gly lie Gly Lys Lys Gly Gin 
145 150 155 160 



Gin Pro Ala Lys Lys Lys Leu Asn Phe Gly Gin Thr Gly Asp Ser Glu 
165 170 175 



Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Ala Pro Ser 
180 185 190 



Ser Val Gly Ser Gly Thr Met Ala Ala Gly Gly Gly Ala Pro Met Ala 
195 200 205 



Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn Trp 
210 215 220 



His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser Thr 
225 230 235 240 



Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin lie 
245 250 255 



Ser Ser Ser Ser Ser Gly Ala Thr Asn Asp Asn His Tyr Phe Gly Tyr 
260 265 270 



Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe 
275 *" 280 285 



Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe Arg 
290 295 300 



Pro Lys Lys Leu Arg Phe Lys Leu Leu Asn lie Gin Val Lys Glu Val 
305 310 315 320 



Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser Thr 
325 330 335 



Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu Gly 
340 345 350 
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Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe Met 
355 360 365 

lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asp Asn Gly Ser Gin Ser Val 
370 ^ ** 375 380 



Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met Leu 

385 390 395 400 

Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser Phe Glu Asp Val Pro 
405 410 415 



Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met Asn 
420 425 430 



Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala Arg Thr Gin Ser Thr 
435 440 445 

Thr Gly Ser Thr Arg Glu Leu Gin Phe His Gin Ala Gly Pro Asn Thr 
450 455 460 



Met Ala Glu Gin Ser Lys Asn Trp Leu Pro Gly Pro Cys Tyr Arg Gin 
465 470 475 480 



Gin Arg Leu Ser Lys Asn Leu Asp Phe Asn Asn Asn Ser Asn Phe Ala 
485 490 495 



Trp Thr Ala Ala Thr Lys Tyr His Leu Asn Gly Arg Asn Ser Leu Thr 
500 505 510 

Asn Pro Gly He Pro Met Ala Thr Asn Lys Asp Asp Glu Asp Gin Phe 
515 520 525 



Phe Pro He Asn Gly Val Leu Val Phe Gly Lys Thr Gly Ala Ala Asn 
530 535 540 



Lys Thr Thr Leu Glu Asn Val Leu Met Thr Ser Glu Glu Glu He Lys 
545 550 555 560 



Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr Gly Val Val Ser Ser Asn 
565 570 575 



Leu Gin Ser Ser Thr Ala Gly Pro Gin Ser Gin Thr He Asn Ser Gin 
580 585 590 



Gly Ala Leu Pro Gly Met Val Trp Gin Asn Arg Asp Val Tyr Leu Gin 
595 600 605 
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Gly Pro lie Trp Ala Lys lie Pro His Thr Asp Gly Asn Phe His Pro 

610 615 620 

Ser Pro Leu Met Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie 

625 630 635 640 



Leu lie Lys Asn Thr Pro Val Pro Ala Asn Pro Pro Glu Val Phe Thr 
645 650 655 



Pro Ala Lys Phe Ala Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val 
660 665 670 



Ser Val Glu lie Glu Trp Glu Leu Gin Lys Glu Asn Ser Lys Arg Trp 
675 680 685 



Asn Pro Glu lie Gin Tyr Thr Ser Asn Tyr Ala Lys Ser Asn Asn Val 
690 695 700 



Glu Phe Ala Val Asn Pro Asp Gly Val Tyr Thr Glu Pro Arg Pro lie 
705 710 715 720 



Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 



<210> 112 
<211> 735 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42. 6B 
<400> 112 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 
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Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 

Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 

He Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin He Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 

Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 

Asn Asn Trp Gly Phe Arg Pro Arg Lys Leu Arg Phe Lys Leu Phe Asn 

305 ~ 310 315 320 



He Gin Val Lys Glu Val Thr Thr Asp Asp Gly Val Thr Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Ser Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 365 
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Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 

Asn Gly Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 

Thr Phe Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 

Leu Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 

Ala Arg Thr Gin Ser Thr Thr Gly Ser Thr Arg Glu Leu Gin Phe His 
450 455 460 



Gin Ala Gly Pro Asn Thr Met Ala Glu Gin Ser Lys Asn Trp Leu Pro 
465 - 470 475 480 

Gly Pro Cys Tyr Arg Gin Gin Arg Leu Ser Lys Asn He Asp Ser Asn 
485 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn 
500 505 510 

Gly Arg Asn Ser Leu Thr Asn Pro Gly Val Ala Met Ala Thr Asn Lys 
515 520 525 

Asp Asp Glu Asp Gin Phe Phe Pro He Asn Gly Val Leu Val Phe Gly 
530 535 540 

Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met Thr 
545 550 555 560 

Ser Glu Glu Glu He Lys Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr 
565 570 575 

Gly Val Val Ser Ser Asn Leu Gin Ser Ser Thr Ala Gly Pro Gin Thr 
580 585 590 

Gin Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn 
595 600 605 

Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr 
610 615 620 
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Asp Gly Asn Phe His Pro Ser Pro Leu Met Asp Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He Thr Gin 
660 665 670, 



Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 

Ala Lys Ser Asn Asn Val Glu Phe Ala Val Asn Asn Glu Gly Val Tyr 
705 710 715 720 



Thr Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 113 
<211> 685 
<212> PRT 

<213> capsid protein of AAV serotype, clone 42.12 
<400> 113 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Lys Gin Leu Glu Gin Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 
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Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 



Gly Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro lie Gly Glu Pro 
180 185 190 



Pro Ala Gly Pro Ser Gly Leu Gly Ser Gly Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp 
260 265 270 



Asn Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn 
305 310 315 320 



lie Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr lie Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 " 360 365 
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Pro Ala Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 395 400 



Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr 
405 410 415 



Gin Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 430 



Leu Asp Arg Leu Thr Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435 440 445 



Ala Arg Thr Gin Ser Thr Thr Gly Ser Thr Arg Gly Leu Gin Phe His 
450 455 460 



Gin Ala Gly Pro Asn Thr Met Ala Glu Gin Ser Lys Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Leu Ser Lys Asn lie Asp Ser Asn 
485 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His Leu Asn 
500 505 510 



Gly Arg Asn Ser Leu Thr Asn Pro Gly Val Ala Met Ala Thr Asn Lys 
515 520 525 



Asp Asp Glu Asp Gin Phe Phe Pro lie Asn Gly Val Leu Val Phe Gly 

530 535 540 

Lys Thr Gly Ala Ala Asn Lys Thr Thr Leu Glu Asn Val Leu Met Thr 

545 550 555 560 



Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr Glu Glu Tyr 
565 570 575 



Gly Val Val Ser Ser Asn Leu Gin Ser Ser Thr Ala Gly Pro Gin Thr 
580 585 590 



Gin Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val Trp Gin Asn 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr 
610 ~ 615 620 
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Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 630 635 640 



His Pro Pro Pro Gin lie Leu lie Lys Tyr Thr Ser Asn Tyr Tyr Lys 
645 650 655 



Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Thr Tyr Ser Glu 
660 665 670 



Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
675 680 685 



<210> 114 
<211> 724 
<212> PRT 

<213> capsid protein of AAV serotype, clone AAV5CAP 
<400> 114 

Met Ser Phe Val Asp His Pro Pro Asp Trp Leu Glu Glu Val Gly Glu 
15 10 15 



Gly Leu Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 
20 25 30 



Pro Asn Gin Gin His Gin Asp Gin Ala Arg Gly Leu Val Leu Pro Gly 
35 40 45 



Tyr Asn Tyr Leu Gly Pro Gly Asn Gly Leu Asp Arg Gly Glu Pro Val 
50 55 60 



Asn Arg Ala Asp Glu Val Ala Arg Glu His Asp lie Ser Tyr Asn Glu 
65 70 75 80 



Gin Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 
85 90 95 



Ala Glu Phe Gin Glu Lys Leu Ala Asp Asp Thr Ser Phe Gly Gly Asn 
100 105 110 



Leu Gly Lys Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro Phe 
115 120 125 



Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Thr Gly Lys Arg lie 
130 135 140 



Asp Asp His Phe Pro Lys Arg Lys Lys Ala Arg Thr Glu Glu Asp Ser 
145 150 155 160 



Lys Pro Ser Thr Ser Ser Asp Ala Glu Ala Gly Pro Ser Gly Ser Gin 
165 170 175 



264 



WO 03/042397 



PCT/US02/33629 



Gin Leu Gin lie Pro Ala Gin Pro Ala Ser Ser Leu Gly Ala Asp Thr 
180 185 190 



Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gin Gly Ala 
195 200 205 



Asp Gly Val Gly Asn Ala Ser Gly Asp Trp His Cys Asp Ser Thr Trp 
210 215 220 



Met Gly Asp Arg Val Val Thr Lys Ser Thr Arg Thr Trp Val Leu Pro 
225 "* 230 235 240 



Ser Tyr Asn Asn His Gin Tyr Arg Glu lie Lys Ser Gly Ser Val Asp 
245 250 255 



Gly Ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 
260 265 270 



Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gin 
275 280 285 



Arg Leu lie Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg Val 
290 295 300 



Lys lie Phe Asn He Gin Val Lys Glu Val Thr Val Gin Asp Ser Thr 
305 310 315 320 



Thr Thr He Ala Asn Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp 
325 330 335 



Asp Asp Tyr Gin Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly Cys 
340 345 350 



Leu Pro Ala Phe Pro Pro Gin Val Phe Thr Leu Pro Gin Tyr Gly Tyr 
355 360 365 



Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 
370 375 380 



Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 
385 390 395 400 



Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His Ser Ser 
405 410 415 



Phe Ala Pro Ser Gin Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 
420 425 430 
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Gin Tyr Leu Tyr Arg Phe Val Ser Thr Asn Asn Thr Gly Gly Val Gin 
435 440 445 



Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 
450 455 460 



Phe Pro Gly Pro Met Gly Arg Thr Gin Gly Trp Asn Leu Gly Ser Gly 
465 470 475 480 



Val Asn Arg Ala Ser Val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 
485 490 495 



Leu Glu Gly Ala Ser Tyr Gin Val Pro Pro Gin Pro Asn Gly Met Thr 
500 505 510 



Asn Asn Leu Gin Gly Ser Asn Thr Tyr Ala Leu Glu Asn' Thr Met lie 
515 520 525 



Phe Asn Ser Gin Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 
530 535 540 



Gly Asn Met Leu lie Thr Ser Glu Ser Glu Thr Gin Pro Val Asn Arg 
545 550 555 560 



Val Ala Tyr Asn Val Gly Gly Gin Met Ala Thr Asn Asn Gin Ser Ser 
565 570 575 



Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gin Glu lie Val Pro 
580 585 590 



Gly Ser Val Trp Met Glu Arg Asp Val Tyr Leu Gin Gly Pro lie Trp 
595 600 605 



Ala Lys lie Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 
610 615 620 



Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu lie Lys Asn 
625 S 630 635 640 



Thr Pro Val Pro Gly Asn lie Thr Ser Phe Ser Asp Val Pro Val Ser 
645 650 655 



Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Thr Val Glu Met Glu 
660 665 670 



Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin 
675 680 685 
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Tyr Thr Asn Asn Tyr Asn Asp Pro Gin Phe Val Asp Phe Ala Pro Asp 
690 695 700 

Ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro lie Gly Thr Arg Tyr Leu 
705 710 715 720 

Thr Arg Pro Leu 

<210> 115 
<211> 9 
<212> DNA 

<213> Drain restriction enzyme site 
<400> 115 

caccacgtc 9 

<210> 116 

<211> 28 

<212> DNA 

<213> AV2cas 

<400> 116 

cgcagagacc aaagttcaac tgaaacga 28 

<210> 117 
<211> 255 
<212> DNA 

<213> adeno-associated virus serotype 10 
<400> 117 

ggtaattcct ccggaaattg gcattgcgat tccacatggc tgggcgacag agtcatcacc 60 
accagcaccc gaacctgggt cctgcccacc tacaacaacc acatctacaa gcaaatctcc 120 
agcgagacag gagccaccaa cgacaaccac tacttcggct acagcacccc ctgggggtat 180 
tttgacttta acagattcca ctgccacttt tcaccacgtg actggcagcg actcatcaac 240 
aacaactggg gattc 255 



<210> 118 
<211> 258 
<212> DNA 

<213> adeno-associated virus serotype 11 
<400> 118 

ggtaattcct ccggaaattg gcattgcgat tccacatggc tgggcgacag agtcatcacc 60 
accagcaccc gaacctgggc cctgccaacc tacaacaacc acctctacaa acaaatctcc 120 
agcgcttcaa cgggggccag caacgacaac cactactttg gctacagcac cccctggggg 180 
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tattttgact ttaacagatt ccactgccac ttctcaccac gtgactggca gcgactcatc 



240 



aacaacaact ggggattc 



258 



<210> 119 
<211> 255 
<212> DNA 

<213> adeno-associated virus serotype 12 
<400> 119 

ggtaattcct ccggaaattg gcattgcgat tccacatggc tgggcgaccg agtcattacc 60 

accagcaccc ggacttgggc cctgcccacc tacaacaacc acctctacaa gcaaatctcc 120 

agccaatcgg gtgccaccaa cgacaaccac tacttcggct acagcacccc ttgggggtat 180 

tttgatttca acagattcca ctgccatttc tcaccacgtg actggcagcg actcatcaac 24 0 

aacaactggg gattc 255 

<210> 120 

<211> 2205 

<212> DNA 

<213> adeno-associated virus serotype, clone A3.1vpl 



<400> 120 
atggctgccg 


atggttatct 


tccagattgg 


ctcgaggaca 


ctctctctga 


aggaatcaga 


60 


cagtggtgga 


agctcaaacc 


tggcccacca 


ccgccgaaac 


ctaaccaaca 


acaccgggac 


120 


gacagtaggg 


gtcttgtgct 


tcctgggtac 


aagtacctcg 


gacccttcaa 


cggactcgac 


180 


aaaggagagc 


cggtcaacga 


ggcagacgcc 


gcggccctcg 


agcacgacaa 


agcctacgac 


240 


caccagctca 


agcaagggga 


caacccgtac 


ctcaaataca 


accacgcgga 


cgctgaattt 


300 


caggagcgtc 


ttcaagaaga 


tacgtctttc 


gggggcaacc 


tcgggcgagc 


agtcttccag 


3 60 


gccaaaaaga 


gggtactcga 


gcctcttggt 


ctggttgagg 


aagctgttaa 


gacggctcct 


420 


ggaaaaaaga 


gacctataga 


gcagtctcct 


gcagaaccgg 


actcttcctc 


gggcatcggc 


480 


aaatcaggcc 


agcagcccgc 


taagaaaaga 


ctcaattttg 


gtcagactgg 


cgacacagag 


540 


tcagtcccag 


accctcaacc 


aatcggagaa 


ccccccgcag 


ccccctctgg 


tgtgggatct 


600 


aatacaatgg 


cttcaggcgg 


tggggcacca 


atggcagaca 


ataacgaagg 


cgccgacgga 


660 


gtgggtaatt 


cctcgggaaa 


ttggcattgc 


gattccacat 


ggatgggcga 


cagagttatc 


720 


accaccagca 


caagaacctg 


ggccctcccc 


acctacaata 


atcacctcta 


caagcaaatc 


780 


tccagcgaat 


cgggagccac 


caacgacaac 


cactacttcg 


gctacagcac 


cccctggggg 


840 


tattttgact 


ttaacagatt 


ccactgtcac 


ttctcaccac 


gtgactggca 


gcgactcatc 


900 


aacaacaact 


ggggatttag 


acccaagaaa 


ctcaatttca 


agctcttcaa 


catccaagtc 


960 


aaggaggtca 


cgcagaatga 


tggaaccacg 


accatcgcca 


ataaccttac 


cagcacggtg 


1020 
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caggtct tea 


cagactctga 


gtaccagctg 


ccctacgtcc 


teggttegge 


tcaccagggc 


lUo 0 


t~/Tr*f*""t~+*r* r- rv c* 
LLJ LL L LLLLjL 


r* rr V" r*' r- r~ -zi rr r* 

eg ttcccagc 


agacg tc u uc 


^^~rr^+-\-r*r*+'r* 

aLgaLtcctc 


agtaeggcta cttgactctg 


1 1 a n 


ddCaaLyyCa 


r>r a rr r» rr rr\~ 

gecaagegg u 


s rr r*r ^ r*» 4" 4* 4* 

aggacg t tct 


^ r* Tk 4* +* /■>« 4~ ^ 4™ 

tCoLLCLaCu 


gtctagagta 


ttttccctct 


1 onn 


c a gat gc tga 


ggacgggaaa 


caact tcacc 


ttcagctaca 


cttttgaaga 


cgtgcctttc 


12 60 


cacagcagct 


acgcgcacag 


ccagagtctg 


gateggctga 


tgaatcctct 


cattgaccag 


1320 


+- /-» +* y-f 4~ -% +- 4- 

LaCCtgLatt 


acct gagcaa 


aac tcagggt 


acaagtggaa 


caacgcagca 


atcgagactg 


138 0 


Cay itcayCC 


aagcugggee 


t agct ccat g 


gc tcagcagg 


ccaaaaactg 


getaceggga 


1 a a n 




gaeagcageg 


aaugucuaag 


aegget aatg 


acaacaacaa 


cagtgaattt 


1 CAft 

ijUU 


rr 1~ t* rr rr r»i"rf 
y i- l. L.y y a l_ l, y 


LagLLaL L. del 


a m l. Lav Lg 


aatrrrraanaa 

aaLggaayaa 


attctctggt 


caatcccggg 


1DOU 




CCdy LCaCaa 


ggacga ugag 


gaaaagta u u 


tccccatgca 


eggaaatetc 


i c 0 n 


aLL-ULtyyaa 


~* — i « ^ *v rr r* a /—1 

aaCaaggCaC 


aggaacxacc 


aat gtggaca 


ttgaatcagt 


gcttattaca 


icon 
1 bo U 


yaCgdagaay 


aaa ucagaac 


aaClaaiCCL 


gtggctacag 


aacaataegg 


acaggttgee 


1 /4U 


accaaccatc 


agagtcagaa 


caccacagct 


tcctatggaa 


gtgtggacag 


ccagggaatc 


1800 


LLaccty gaa 


ugg ug t ggca 


ggaccgegat 


gtct atcttc 


aaggtcccat 


ttgggccaaa 




ac t cc tcaca 


eggaeggaca 


ctttcatcct 


tc tccgctca 


tgggaggctt 


tggactgaaa 


192 0 


cacccLCCuc 


cccagatcct 


gatcaaaaac 


acacctgtgc 


cagcgaatcc 


cgcgaccact 


1980 


ttcactcctg 


gaaagtttgc 


ttcgttcatt 


acccagtatt 


ccaccggaca 


ggtcagcgtg 


2040 


gaaatagagt 


gggagctgea 


gaaagaaaac 


ageaaacget 


ggaacccaga 


aattcagtac 


2100 


acctccaact 


acaacaagtc 


ggtgaatgtg 


gagtttaccg 


tggacgcaaa 


cggtgtttat 


2160 


tctgaacccc 


gecctattgg 


cactcgttac 


cttacccgga 


acttg 




2205 
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(54) Title: A METHOD OF DETECTING AND/OR IDENTIFYING ADENO-ASSOCIATED VIRUS (AAV) SEQUENCES AND 
O ISOLATING NOVEL SEQUENCES IDENTIFIED THEREBY 
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^ (57) Abstract: A method for detecting and isolating AAV sequences in a sample of DNA obtained from tissue or cells is provided. 
^ The invention further provides AAV sequences identified by this method, and vectors constructed using these sequences. 
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A METHOD OF DETECTING AND/OR IDENTIFYING ADENO-ASSOCIATED 
VIRUS (AAV) SEQUENCES AND ISOLATING NOVEL SEQUENCES IDENTIFIED 
THEREBY 

5 BACKGROUND OF THE INVENTION 

Adeno-associated virus (AAV), a member of the Parvovirus family, is a small 
nonenveloped, icosahedral virus with single-stranded linear DNA genomes of 4.7 kilobases 
(kb) to 6 kb. AAV is assigned to the genus, Dependovirus, because the virus was discovered 
as a contaminant in purified adenovirus stocks, AAV's life cycle includes a latent phase at 

10 which AAV genomes, after infection, are site specifically integrated into host chromosomes 
and an infectious phase in which, following either adenovirus or herpes simplex virus 
infection, the integrated genomes are subsequently rescued, replicated, and packaged into 
infectious viruses. The properties of non-pathogenicity, broad host range of infectivity, 
including non-dividing cells, and potential site-specific chromosomal integration make AAV 

1 5 an attractive tool for gene transfer. 

Recent studies suggest that AAV vectors may be the preferred vehicle for gene 
therapy. To date, there have been 6 different serotypes of AAVs isolated from human or 
non-human primates (NHP) and well characterized. Among them, human serotype 2 is the 
first AAV that was developed as a gene transfer vector; it has been widely used for efficient 

20 gene transfer experiments in different target tissues and animal models. Clinical trials of the 
experimental application of AAV2 based vectors to some human disease models are in 
progress, and include such diseases as cystic fibrosis and hemophilia B. 
What are desirable are AAV-based constructs for gene delivery. 

25 SUMMARY OF THE INVENTION 

In one aspect, the invention provides a novel method of detecting and identifying 
AAV sequences from cellular DNAs of various human and non-human primate (NHP) 
tissues using bioinformatics analysis, PCR based gene amplification and cloning technology, 
based on the nature of latency and integration of AAVs in the absence of helper virus co- 

30 infection. 

In another aspect, the invention provides method bf isolating novel AAV sequences 
detected using the above described method of the invention. The invention further comprises 
methods of generating vectors based upon these novel AAV serotypes, for serology and gene 
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transfer studies solely based on availability of capsid gene sequences and structure of rep/cap 
gene junctions. 

In still another aspect, the invention provides a novel method for performing studies 
of serology, epidemiology, biodistribution and mode of transmission, using reagents 
5 according to the invention, which include generic sets of primers/probes and quantitative real 
time PCR. 

In yet another aspect, the invention provides a method of isolating complete and 
infectious genomes of novel AAV serotypes from cellular DN A of different origins using 
RACE and other molecular techniques. 
10 In a further aspect, the invention provides a method of rescuing novel serotypes of 

AAV genomes from human and NHP cell lines using adenovirus helpers of different origins. 

In still a further aspect, the invention provides novel AAV serotypes, vectors 
containing same, and methods of using same. 

These and other aspects of the invention will be readily apparent from the following 
15 detailed description of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1 A through 1 AAAR provide an alignment of the nucleic acid sequences 
encoding at least the cap proteins for the AAV serotypes. The full-length sequences 

20 including the ITRs, the rep region, and the capsid region are provided for novel AAV 
serotype 7 [SEQ ID NO:l], and for previously published AAV1 [SEQ IN NO:6], AAV2 
[SEQ ID NO:7]; and AAV3 [SEQ ID NO:8]. Novel AAV serotypes AAV8 [SEQ ID NO:4] 
and AAV9 [SEQ ID NO:5] are the subject of co-filed applications. The other novel clones 
of the invention provided in this alignment include: 42-2 [SEQ ID NO:9], 42-8 [SEQ ID 

25 NO:27], 42-15 [SEQ ID NO:28], 42-5b [SEQ ID NO: 29], 42- lb [SEQ ID NO:30]; 42-13 

[SEQ ID NO: 3 1], 42-3a [SEQ ID NO: 32], 42-4 [SEQ ID NO:33], 42-5a [SEQ ID NO: 34], 
42-10 [SEQ ID NO:35], 42-3b [SEQ ID NO: 36], 42-1 1 [SEQ ID NO: 37], 42-6b [SEQ ID 
NO:38], 43-1 [SEQ ID NO: 39], 43-5 [SEQ ID NO: 40], 43-12 [SEQ ID NO:41], 43-20 
[SEQ ID NO:42], 43-21 [SEQ ID NO: 43], 43-23 [SEQ ID NO:44], 43-25 [SEQ ID NO: 45], 

30 44.1 [SEQ ID NO:47], 44.5 [SEQ ID NO:47], 223.10 [SEQ ID NO:48], 223.2 [SEQ ID 
NO:49], 223.4 [SEQ ID NO:50], 223.5 [SEQ ID NO: 51], 223.6 [SEQ ID NO: 52], 223.7 
[SEQ ID NO: 53], A3.4 [SEQ ID NO: 54], A3.5 [SEQ ID NO:55], A3 .7 [SEQ ID NO: 56], 
A3.3 [SEQ ID NO:57], 42.12 [SEQ ID NO: 58], 44.2 [SEQ ID NO: 59]. The nucleotide 
sequences of the signature regions of AAV10 [SEQ ID NO: 117], AAV 1 1 [SEQ ID NO: 
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118] and AAV 12 [SEQ ID NO:l 19] are provided in this figure. Critical landmarks in the 
structures of AAV genomes are shown. Gaps are demonstrated by dots. The 3' ITR of AAV 1 
[SEQ ID NO:6] is shown in the same configuration as in the published sequences. TRS 
represents terminal resolution site. Notice that AAV7 is the only AAV reported that uses 
5 GTG as the initiation codon for VP3. 

Figs. 2A through 2F are an alignment of the amino acid sequences of the 
proteins of the vpl capsid proteins of previously published AAV serotypes 1 [SEQ ID 
NO:64], AAV2 [SEQ ID NO:70] 5 AAV3 [SEQ ID NO: 71], AAV4 [SEQ ID NO:63], AAV5 
[SEQ ID NO:l 14], and AAV6 [SEQ ID NO:65] and novel AAV sequences of the invention, 
10 including: CI [SEQ IDNO:60], C2 [SEQ IDNO:61], C5 [SEQ IDNO:62], A3-3 [SEQ ID 
NO:66], A3-7 [SEQ ID NO:67], A3-4 [SEQ ID NO:68], A3-5 [SEQ ID NO: 69], 3.3b [SEQ 
ID NO: 62], 223.4 [SEQ ID NO: 73], 223-5 [SEQ IDNO:74], 223-10 [SEQ ID NO:75], 223- 
2 [SEQ ID NO:76], 223-7 [SEQ ID NO: 77], 223-6 [SEQ ID NO: 78], 44-1 [SEQ ID NO: 
79], 44-5 [SEQ ID NO:80], 44-2 [SEQ ID NO:81], 42-15 [SEQ ID NO: 84], 42-8 [SEQ ID 
15 NO: 85], 42-13 [SEQ ID NO:86], 42-3A [SEQ ID NO:87], 42-4 [SEQ ID NO:88], 42-5A 
[SEQ ID NO:89], 42-1 B [SEQ ID NO:90], 42-5B [SEQ ID NO:91], 43-1 [SEQ ID NO: 92], 
43-12 [SEQ ID NO: 93], 43-5 [SEQ ID NO:94], 43-21 [SEQ ID NO:96], 43-25 [SEQ ID 
NO: 97], 43-20 [SEQ ID NO:99], 24.1 [SEQ ID NO: 101], 42.2 [SEQ ID NO:102], 7.2 [SEQ 
ID NO: 103], 27.3 [SEQ ID NO: 104], 16.3 [SEQ ID NO: 105], 42.10 [SEQ ID NO: 106], 
20 42-3B[SEQIDNO: 107], 42-11 [SEQ ID NO: 108], Fl [SEQ ID NO: 109], F5 [SEQ ID 
NO: 1 1 0], F3 [SEQ ID NO:l 1 1], 42-6B [SEQ ID NO: 112], 42-12 [SEQ ID NO: 1 1 3], 
Novel serotypes AAV8 [SEQ ID NO:95] and AAV9 [SEQ ID NO:100] are the subject of co- 
filed patent applications. 

Figs. 3 A through 3C provide the amino acid sequences of the AAV7 rep 
25 proteins [SEQ ID NO:3]. 

DETAILED DESCRIPTION OF THE INVENTION 

In the present invention, the inventors have found a method which takes advantage of 
the ability of adeno-associated virus (AAV) to penetrate the nucleus, and, in the absence of a 
30 helper virus co-infection, to integrate into cellular DNA and establish a latent infection. This 
method utilizes a polymerase chain reaction (PCR)-based strategy for detection, 
identification and/or isolation of sequences of AAVs from DNAs from tissues of human and 
non-human primate origin as well as from other sources. Advantageously, this method is 
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also suitable for detection, identification and/or isolation of other integrated viral and non- 
viral sequences, as described below. 

The invention further provides nucleic acid sequences identified accord ing'to the 
methods of the invention. One such adeno-associated* virus is of a novel serotype, termed 
5 herein serotype 7 (AAV7). Other novel adeno-associated virus serotypes provided herein 
include AAV10, AAV11 , and AAV12. Still other novel AAV serotypes identified according 
to the methods of the invention are provided in the present specification. See, Figures and 
Sequence Listing, which is incorporated by reference. 

Also provided are fragments of these AAV sequences. Among particularly desirable 
10 AAV fragments are the cap proteins, including the vpl, vp2, vp3, the hypervariable regions, 
v the rep proteins, including rep 78, rep 68, rep 52, and rep 40, and the sequences encoding 
these proteins. Each of these fragments may be readily utilized in a variety of vector systems 
and host cells. Such fragments may be used alone, in combination with other AAV 
sequences or fragments, or in combination with elements from other AAV or non-AAV viral 
] 5 sequences. In one particularly desirable embodiment, a vector contains the AAV cap and/or 
rep sequences of the invention. 

As described herein, alignments are performed using any of a variety of publicly or 
commercially available Multiple Sequence Alignment Programs, such as "Clustal W", 
accessible through Web Servers on the internet. Alternatively, Vector NTI utilities are also 
20 used. There are also a number of algorithms known in the art which can be used to measure 
nucleotide sequence identity, including those contained in the programs described above. As 
another example, polynucleotide sequences can be compared using Fasta, a program in GCG 
Version 6.1 . Fasta provides alignments and percent sequence identity of the regions of the 
best overlap between the query and search sequences. For instance, percent sequence 
25 identity between nucleic acid sequences can be determined using Fasta with its default 

parameters (a word size of 6 and the NOPAM factor for the scoring matrix) as provided in 
GCG Version 6.1, herein incorporated by reference. Similar programs are available for 
amino acid sequences, e.g., the C£ Clustal X" program. Generally, any of these programs are 
used at default settings, although one of skill in the art can alter these settings as needed. 
30 Alternatively, one of skill in the art can utilize another algorithm or computer program which 
provides at least the level of identity or alignment as that provided by the referenced 
algorithms and programs. 

The term "substantial homology" or "substantial similarity," when referring to a 
nucleic acid, or fragment thereof, indicates that, when optimally aligned with appropriate 
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nucleotide insertions or deletions with another nucleic acid (or its complementary strand), 
there is nucleotide sequence identity in at least about 95 to 99% of the aligned sequences. 
Preferably, the homology is over full-length sequence, or an open reading frame thereof, or 
another suitable fragment which is at least 15 nucleotides in length. Examples of suitable 

5 fragments are described herein. 

The term "substantial homology" or "substantial similarity," when referring to amino 
acids or fragments thereof, indicates that, when optimally aligned with appropriate amino 
acid insertions or deletions with another amino acid, there is amino acid sequence identity in 
at least about 95 to 99% of the aligned sequences. Preferably, the homology is over full- 

10 length sequence, or a protein thereof, e.g., a cap protein, a rep protein, or a fragment thereof 
which is at least 8 amino acids, or more desirably, at least 15 amino acids in length. 
Examples of suitable fragments are described herein. 

By the term "highly conserved" is meant at least 80% identity, preferably at least 
90% identity, and more preferably, over 97% identity. Identity is readily determined by one 

15 of skill in the art by resort to algorithms and computer programs known by those of skill in 
the art. 

The term "percent sequence identity" or "identical" in the context of nucleic acid 
sequences refers to the residues in the two sequences which are the same when aligned 
for maximum correspondence. The length of sequence identity comparison may be over 

20 the full-length of the genome, the full-length of a gene coding sequence, or a fragment of 
at least about 500 to 5000 nucleotides, is desired. However, identity among smaller 
fragments, e.g. of at least about nine nucleotides, usually at least about 20 to 24 
nucleotides, at least about 28 to 32 nucleotides, at least about 36 or more nucleotides, 
may also be desired. Similarly, "percent sequence identity" may be readily determined 

25 for amino acid sequences, over the full-length of a protein, or a fragment thereof. 

Suitably, a fragment is at least about 8 amino acids in length, and may be up to about 700 
amino acids. Examples of suitable fragments are described herein. 

The AAV sequences and fragments thereof are useful in production of rAAV, and 
are also useful as antisense delivery vectors, gene therapy vectors, or vaccine vectors. The 

30 invention further provides nucleic acid molecules, gene delivery vectors, and host cells 
which contain the AAV sequences of the invention. 
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As described herein, the vectors of the invention containing the AAV capsid proteins ~" 
of the invention are particularly well suited for use in applications in which the neutralizing 
antibodies diminish the effectiveness of other AAV serotype based vectors, as well as other 

5 viral vectors. The rA AV vectors of the invention are particularly advantageous in rA AV 
readministration and repeat gene therapy. 

These and other embodiments and advantages of the invention are described in more 
detail below. As used throughout this specification and the claims, the terms "comprising" 
and "including" and their variants are inclusive of other components, elements, integers, 

10 steps and the like. Conversely, the term "consisting" and its variants is exclusive of other 
components, elements, integers, steps and the like. 

I. Methods of the Invention 

A. Detection of Sequences Via Molecular Cloning 

15 In one aspect, the invention provides a method of detecting and/or identifying 

target nucleic acid sequences in a sample. This method is particularly well suited for 
detection of viral sequences which are integrated into the chromosome of a cell, e.g., adeno- 
associated viruses (AAV) and retroviruses, among others. The specification makes 
reference to AAV, which is exemplified herein. However, based on this information, one of 

20 skill in the art may readily perform the methods of the invention on retroviruses [e.g., feline 
leukemia virus (FeLV), HTLVI and HTLVII], and lentivirinae [e.g., human 
immunodeficiency virus (HIV), simian immunodeficiency virus (SIV), feline 
immunodeficiency virus (FIV), equine infectious anemia virus, and spumavirinal)], among 
others: Further, the method of the invention may also be used for detection of other viral 

25 and non-viral sequences, whether integrated or non-integrated into the genome of the host 
cell. 

As used herein, a sample is any source containing nucleic acids, e.g., tissue, 
tissue culture, cells, cell culture, and biological fluids including, without limitation, urine and 
blood. These nucleic acid sequences may be DNA or RNA from plasmids, natural DNA or 
30 RNA from any source, including bacteria, yeast, viruses, and higher organisms such as plants 
or animals. DNA or RNA is extracted from the sample by a variety of techniques known to 
those of skill in the ait, such as those described by Sambrook, Molecular Cloning: A 
Laboratory Manual (New York: Cold Spring Harbor Laboratory). The origin of the sample 
and the method by which the nucleic acids are obtained for application of the method of the 
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invention is not a limitation of the present invention. Optionally, the method of the invention 
can be performed directly on the source of DNA, or on nucleic acids obtained (e.g., 
extracted) from a source. 

The method of the invention involves subjecting a sample containing DNA to 
5 amplification via polymerase chain reaction (PCR) using a first set of primers specific for a 
first region of double-stranded nucleic acid sequences, thereby obtaining amplified 
sequences. 

As used herein, each of the "regions" is predetermined based upon the 
alignment of the nucleic acid sequences of at least two serotypes (e.g., AAV) or strains (e.g., 

10 lentiviruses), and wherein each of said regions is composed of sequences having a 5' end 
which is highly conserved, a middle which is preferably, but necessarily, variable, and a 3 r 
end which is highly conserved, each of these being conserved or variable relative to the 
sequences of the at least two aligned AAV serotypes. Preferably, the 5' and/or 3* end is 
highly conserved over at least about 9, and more preferably, at least 1 8 base pairs (bp). 

15 However, one or both of the sequences at the 5' or 3' end may be conserved over more than 
18 bp, more than 25 bp, more than 30 bp, or more than 50 bp at the 5' end. With respect to 
the variable region, there is no requirement for conserved sequences, these sequences may be 
relatively conserved, or may have less than 90, 80, or 70% identity among the aligned 
serotypes or strains. 

20 Each of the regions may span about 100 bp to about 10 Icilobase pairs in 

length. However, it is particularly desirable that one of the regions is a "signature region", 
i.e., a region which is sufficiently unique to positively identify the amplified sequence as 
being from the target source. For example, in one embodiment, the first region is about 250 
bp in length, and is sufficiently unique among known AAV sequences, that it positively 
25 identifies the amplified region as being of AAV origin. Further, the variable sequences 

within this region are sufficiently unique that can be used to identify the serotype from which 
the amplified sequences originate. Once amplified (and thereby detected), the sequences can 
be identified by performing conventional restriction digestion and comparison to restriction 
digestion patterns for this region in any of AAV I, AAV2, AAV3, AAV4, AAV5, or AAV6, 
30 or that of AAV7, AAV 10, AAV1 1, AAV12, or any of the other novel serotypes identified by 
the invention, which is predetermined and provided by the present invention. 

Given the guidance provided herein, one of skill in the art can readily 
identify such regions among other integrated viruses to permit ready detection and 
identification of these sequences. Thereafter, an optimal set of generic primers located 
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within the highly conserved ends can be designed and tested for efficient amplification of the 
selected region from samples. This aspect of the invention is readily adapted to a diagnostic 
kit for detecting the presence of the target sequence (e.g., AAV) and for identifying the AAV 
serotype, using standards which include the restriction patterns for the AAV serotypes 
described herein or isolated using the techniques described herein. For example, quick 
identification or molecular serotyping of PCR products can be accomplished by digesting the 
PGR products and comparing restriction patterns. 

Thus, in one embodiment, the "signature region" for AAV spans about bp 
2800 to about 3200 of AAV 1 [SEQ ID NO:6], and corresponding base pairs in AAV 2, 
AAV3, AAV4, AAV5, and AAV6. More desirably, the region is about 250 bp, located 
within bp 2886 to about 3143 bp of AAV 1 [SEQ IDNO:6], and corresponding base pairs in 
AAV 2 [SEQ ID NO:7], AAV3 [SEQ ID N08], and other AAV serotypes. See, Fig. 1. To 
permit rapid detection of AAV in the sample, primers which specifically amplify this 
signature region are utilized. However, the present invention is not limited to the exact 
sequences identified herein for the AAV signature region, as one of skill in the art may 
readily alter this region to encompass a shorter fragment, Or a larger fragment of this 
signature region. 

The PCR primers are generated using techniques known to those of skill in 
the art. Each of the PCR primer sets is composed of a 5' primer and a 3' primer. See, e.g., 
Sambrook et al, cited herein. The term "primer" refers to an oligonucleotide which acts as a 
point of initiation of synthesis when placed under conditions in which synthesis of a primer 
extension product which is complementary to a nucleic acid strand is induced. The primer is 
preferably single stranded. However, if a double stranded primer is utilized, it is treated to 
separate its strands before being used to prepare extension products. The primers may be 
about 15 to 25 or more nucleotides, and preferably at least 18 nucleotides. However, for 
certain applications shorter nucleotides, e.g., 7 to 15 nucleotides are utilized. 

The primers are selected to be sufficiently complementary to the different 
strands of each specific sequence to be amplified to hybridize with their respective strands. 
Therefore, the primer sequence need not reflect the exact sequence of the region being 
amplified. For example, a non-complementary nucleotide fragment may be attached to the 
5' end of the primer, with the remainder of the primer sequence being completely 
complementary to the strand. Alternatively, non-complementary bases or longer sequences 
can be interspersed into the primer, provided that the primer sequence has sufficient 
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complementarity with the sequence of the strand to be amplified to hybridize therewith and 
form a template for synthesis of the extension product of the other primer. 

The PCR primers for the signature region according to the invention are 
based upon the highly conserved sequences of two or more aligned sequences (e.g., two or 

5 more AAV serotypes). The primers can accommodate less than exact identity among the 
two or more aligned AAV serotypes at the 5' end or in the middle. However, the sequences 
at the 3 5 end of the primers correspond to a region of two or more aligned AAV serotypes in 
which there is exact identity over at least five, preferably, over at least nine base pairs, and 
more preferably, over at least 18 base pairs at the 3 5 end of the primers. Thus, the V end of 

10 the primers is composed of sequences with 100% identity to the aligned sequences over at 
least five nucleotides. However, one can optionally utilize one, two, or more degenerate 
nucleotides at the 3' end of the primer. 

For example, the primer set for the signature region of AAV was designed 
based upon a unique region within the AAV capsid, as follows. The 5' primer was based 

15 upon nt 2867-2891 of AAV2 [SEQ IDNO:7], 5'- 

GGTAATTCCTCCGGA AATTGGCATT3 5 . See, Fig. 1. The 3 5 primer was designed 
based upon nt 3096-3 122 of AAV2 [SEQ ID NO:7], 5'- 

GACTCATCAACAACAACTGGGGATTC-3\ However, one of skill in the art may have 
readily designed the primer set based upon the corresponding regions of AAV 1, AAV3, 

20 AAV4, AAV5, AAV6, or based upon the information provided herein, AAV7, AAV10, 

AAV1 1, AAV12, or another novel AAV of the invention. In addition, still other primer sets 
can be readily designed to amplify this signature region, using techniques known to those of 
skill in the art. 

B . Isolation of Target Sequences 

25 As described herein, the present invention provides a first primer set which 

specifically amplifies the signature region of the target sequence, e.g., an AAV serotype, in 
order to permit detection of the target. In a situation in which further sequences are desired, 
e.g., if a novel AAV serotype is identified, the signature region may be extended. Thus, the 
invention may further utilize one or more additional primer sets. 

30 Suitably, these primer sets are designed to include either the 5' or 3' primer 

of the first primer set and a second primer unique to the primer set, such that the primer set 
amplifies a region 5' or 3 5 to the signature region which anneals to either the 5' end or the 3' 
end of the signature region. For example, a first primer set is composed of a 5' primer, PI 
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and a 3' primer P2 to amplify the signature region. In order to extend the signature region on 
its 3' end, a second primer set is composed of primer PI and a 3' primer P4, which amplifies 
the signature region and contiguous sequences downstream of the signature region. In order 
to extend the signature 'region on its 5' end, a third primer set is composed of a 5' primer, P5, 
5 and primer P2, such that the signature region and contiguous sequences upstream of the 

signature region are amplified. These extension steps are repeated (or performed at the same 
time), as needed or desired. Thereafter, the products results from these amplification steps 
are fused using conventional steps to produce an isolated sequence of the desired length. 

The second and third primer sets are designed, as with the primer set for the 
10 signature region, to amplify a region having highly conserved sequences among the aligned 
sequences. Reference herein to the term "second" or "third" primer set is for each of 
discussion only, and without regard to the order in which these primers are added to the 
reaction mixture, or used for amplification. The region amplified by the second primer set is 
selected so that upon amplification it anneals at its 5' end to the 3' end of the signature 
1 5 region. Similarly, the region amplified by the third primer set is selected so that upon 

amplification it anneals at its 3' end anneals to the 5' end of the signature region. Additional 
primer sets can be designed such that the regions which they amplify anneal to the either the 
5' end or the 3' end of the extension products formed by the second or third primer sets, or 
by subsequent primer sets. 
20 For example, where AAV is the target sequence, a first set of primers (P 1 and 

P2) are used to amplify the signature region from the sample. In one desirable embodiment, 
this signature region is located within the AAV capsid. A second set of primers (PI and P4) 
is used to extend the 3' end of the signature region to a location in the AAV sequence which 
is just before the AAV 3' ITR, i.e., providing an extension product containing the entire 3 ! 
25 end of the AAV capsid when using the signature region as an anchor. In one embodiment, 
the P4 primer corresponds to nt 4435 to 4462 of AAV2 [SEQ ID NO:7], and corresponding 
sequences in the other AAV serotypes. This results in amplification of a region of about 1 .6 
lcb, which contains the 0.25 kb signature region. A third set of primers (P3 and P2) is used to 
extend the 5' end of signature region to a location in the AAV sequences which is in the 3' 
30 end of the rep genes, i.e., providing an extension product containing the entire 5' end of the 
AAV capsid when using the signature region as an anchor. In one embodiment, the P3 
primer corresponds to nt 1384 to 1409 of AAV2 [SEQ ID NO:7], and corresponding 
sequences in the other AAV serotypes. This results in amplification of a region of about 1 .7 
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kb, which contains the 0.25 kb signature region. Optionally, a fourth set of primers are used 
to further extend the extension product containing the entire 5' end of the AAV capsid to 
also include the rep sequences. In one embodiment, the primer designated P5 corresponds 
to nt 103 to 133 of AAV2 [SEQ ID NO:7), and corresponding sequences in the other AAV 

5 serotypes and is used in conjunction with the P2 primer. 

Following completion of the desired number of extension steps, the various 
extension products are fused, making use of the signature region as an anchor or marker, to 
construct an intact sequence. In the example provided herein, AAV sequences containing, at 
a minimum, an intact AAV cap gene are obtained. Larger sequences may be obtained, 

10 depending upon the number of extension steps performed. 

Suitably, the extension products are assembled into an intact AAV sequence 
using methods known to those of skill in the art. For example, the extension products may 
be digested with Dralll, which cleaves at the Dralll site located within the signature region, 
to provide restriction fragments which are re-ligated to provide products containing (at a 

15 minimum) an intact AAV cap gene. However, other suitable techniques for assembling the 
extension products into an intact sequence may be utilized. See, generally, Sambrook et aL 
cited herein. 

As an alternative to the multiple extension steps described above, another 
embodiment of the invention provides for direct amplification of a 3.1 kb fragment which 

20 allows isolation of full-length cap sequences. To directly amplify a 3.1 kb full-length cap 
fragment from NHP tissue and blood DNAs, two other highly conserved regions were 
identified in AAV genomes for use in PCR amplification of large fragments. A primer within 
a conserved region located in the middle of the rep gene is utilized (AVlns: 5' 
GCTGCGTCAACTGGACCAATGAGAAC 3', nt of SEQ ID NO:6) in combination with 

25 the 3' primer located in another conserved region downstream of the Cap gene (A V2cas: 5' 
CGCAGAGACCAAAGTTCAACTGAAACGA 3', SEQ ID NO: 7) for amplification of 
AAV sequences including the full-length AAV cap. Typically, following amplification, the 
products are cloned and sequence analysis is performed with an accuracy of > 99.9%. Using 
this method, the inventors have isolated at least 50 capsid clones which have subsequently 

30 been characterized. Among them, 37 clones were derived from Rhesus macaque tissues 

(rh.l - rh.37), 6 clones from cynomologous macaques (cy.l - cy.6), 2 clones from Baboons 
(bb.l and bb.2) and 5 clones from Chimps (ch.l -ch.5). These clones are identified 
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elsewhere in the specification, together with the species of animal from which they were 
identified and the tissues in that animal these novel sequences have been located. 
C. Alternative method for isolating novel AAV 

In another aspect, the invention provides an alternative method for isolating 
5 novel AAV from a cell. This method involves infecting the cell with a vector which 
provides helper functions to the AAV; isolating infectious clones containing AAV; 
sequencing the isolated AAV; and comparing the sequences of the isolated AAV to known 
AAV serotypes, whereby differences in the sequences of the isolated AAV and known AAV 
serotypes indicates the presence of a novel AAV. 
10 In one embodiment, the vector providing helper functions provides essential 

adenovirus functions, including, e.g., Ela, Elb, E2a, E40RF6. In one embodiment, the 
helper functions are provided by an adenovirus. The adenovirus may be a wild-type 
adenovirus, and may be of human or non-human origin, preferably non-human primate 
(NHP) origin. The DNA sequences of a number of adenovirus types are available from 
15 Genbank, including type Ad5 [Genbank Accession No. M73260]. The adenovirus sequences 
may be obtained from any known adenovirus serotype, such as serotypes 2, 3, 4, 7, 12 and 
40, and further including any of the presently identified human types [see, e.g., Horwitz, 
cited above]. Similarly adenoviruses known to infect non-human animals (e.g., 
chimpanzees) may also be employed in the vector constructs of this invention. See, e.g., US 
20 Patent No. 6,083,716. In addition to wild-type adenoviruses, recombinant viruses or non- 
viral vectors (e.g., piasmids, episomes, etc.) carrying the necessary helper functions may be 
utilized. Such recombinant viruses are known in the art and may be prepared according to 
published techniques. See, e.g., US Patent No. 5,871,982 and US Patent 6,251,677, which 
describe a hybrid Ad/AAV virus. The selection of the adenovirus type is not anticipated to 
25 limit the following invention. A variety of adenovirus strains are available from the 
American Type Culture Collection, Manassas, Virginia, or available by request from a 
variety of commercial and institutional sources. Further, the sequences of many such strains 
are available from a variety of databases including, e.g., PubMed and GenBank. 

In another, alternative, infectious AAV may be isolated using genome 
30 walking technology (Siebertef al, 1995, Nucleic Acid Research, 23:1087-1088, Friezner- 
Degen et ai, 1986, J. Biol Chem. 261:6972-6985, BD Biosciences Clontech, Palo Alto, 
CA). Genome walking is particularly well suited for identifying and isolating the sequences 
adjacent to the novel sequences identified according to the method of the invention. For 
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example, this technique may be useful for isolating inverted terminal repeat (ITRs) of the 
novel AAV serotype, based upon the novel AAV capsid and/or rep sequences identified 
using the methods of the invention. This technique is also useful for isolating sequences 
adjacent to other AAV and non-AAV sequences identified and isolated according to the 
5 present invention. See, Examples 3 and 4. 

The methods of the invention may be readily used for a variety of 
epidemiology studies, studies of biodistribution, monitoring of gene therapy via AAV 
vectors and vector derived from other integrated viruses. Thus, the methods are well suited 
for use in pre-packaged kits for use by clinicians, researchers, and epidemiologists. 

10 

II. Diagnostic Kit 

In another aspect, the invention provides a diagnostic kit for detecting the presence 
of a known or unknown adeno-associated virus (AAV) in a sample. Such a kit may contain 
a first set of 5' and 3' PCR primers specific for a signature region of the AAV nucleic acid 

15 sequence. Alternatively, or additionally, such a kit can contain a first set of 5' and 3' PCR 
primers specific for the 3.1 kb fragment which includes the full-length AAV capsid nucleic 
acid sequence identified herein (e.g., the AVlns and AV2cas primers.) Optionally, a kit of 
the invention may further contain two or more additional sets of 5' and 3' primers, as 
described herein, and/or PCR probes. These primers and probes are used according to the 

20 present invention amplify signature regions of each AAV serotype, e.g., using quantitative 
PCR. 

The invention further provides a kit useful for identifying an AAV serotype detected 
according to the method of the invention and/or for distinguishing novel AAV from known 
AAV. Such a kit may further include one or more restriction enzymes, standards for AAV 
25 serotypes providing their "signature restriction enzyme digestions analyses", and/or other 
means for determining the serotype of the AAV detected. 

In addition, kits of the invention may include, instructions, a negative and/or positive 
control, containers, diluents and buffers for the sample, indicator charts for signature 
comparisons, disposable gloves, decontamination instructions, applicator sticks or 
30 containers, and sample preparator cups, as well as any desired reagents, including media, 
wash reagents and concentration reagents. Such reagents may be readily selected from 
among the reagents described herein, and from among conventional concentration reagents. 
In one desirable embodiment, the wash reagent is an isotonic saline solution which has been 
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buffered to physiologic pH, such as phosphate buffered saline (PBS); the elution reagent is 
PBS containing 0.4 MNaCl, and the concentration reagents and devices. For example, one 
of skill in the art will recognize that reagents such as polyethylene glycol (PEG), or NH4SO4 
may be useful, or that devices such as filter devices. For example, a filter device with a 100 

5 K membrane would concentrate rAAV. 

The kits provided by the present invention are useful for performing the methods 
described herein, and for study of biodistribution, epidemiology, mode of transmission of 
novel AAV serotypes in human and NHPs. 

Thus, the methods and kits of the invention permit detection, identification, and 

10 isolation of target viral sequences, particularly integrated viral sequences. The methods and 
kits are particularly well suited for use in detection, identification and isolation of AAV 
sequences, which may include novel AAV serotypes. 

In one notable example, the method of the invention facilitated analysis of cloned 
AAV sequences by the inventors, which revealed heterogeneity of proviral sequences 

15 between cloned fragments from different animals, all of which were distinct from the known 
six AAV serotypes, with the majority of the variation localized to hypervariable regions of 
the capsid protein. Surprising divergence of AAV sequences was noted in clones isolated 
from single tissue sources, such as lymph node, from an individual rhesus monkey. This 
heterogeneity is best explained by apparent evolution of AAV sequence within individual 

20 animals due, in part, to extensive homologous recombination between a limited number of 
co-infecting parenteral viruses. These studies suggest sequence evolution of widely 
disseminated virus during the course of a natural AAV infection that presumably leads to the 
formation of swarms of quasispecies which differ from one another in the array of capsid 
hypervariable regions. This is the first example of rapid molecular evolution of a DNA virus 

25 in a way that formerly was thought to be restricted to RNA viruses. 

Sequences of several novel AAV serotypes identified by the method of the invention 
and characterization of these serotypes is provided. 

III. Novel AAV Serotypes 
30 A. Nucleic Acid Sequences 

Nucleic acid sequences of novel AAV serotypes identified by the methods of 
the invention are provided. See, SEQ ID NO:l, 9 - 59, and 1 17 - 120, which are 
incorporated by reference herein. See also, Fig. 1 and the sequence listing. 



14 



WO 03/042397 



PCT/US02/33629 



For novel serotype AAV7 5 the full-length sequences, including the AAV 5' 
ITRs, capsid, rep, and AAV 3 5 ITRs are provided in SEQ ID NO: I. 

For other novel AAV serotypes of the invention, the approximately 3.1 kb 
fragment isolated according to the method of the invention is provided. This fragment 

5 contains sequences encoding full-length capsid protein and all or part of the sequences 
encoding the rep protein. These sequences include the clones identified below. 

For still other novel AAV serotypes, the signature region encoding the capsid 
protein is provided. For example, the AAV1 0 nucleic acid sequences of the invention 
include those illustrated in Fig. 1 [See, SEQ ID NOT 17, which spans 255 bases]. The 

10 . AAV1 1 nucleic acid sequences of the invention include the DNA sequences illustrated in 

Fig. 1 [See, SEQ ID NOT 18 which spans 258 bases]. The AAV 12 nucleic acid sequences of 
the invention include the DNA sequences illustrated in Fig. 1 [See, SEQ ID NO: 11 9, which 
consists of 255 bases ]. Using the methodology described above, further AAV10, AAV1 1 
and AAV 12 sequences can be readily identified and used for a variety of purposes, including 

15 those described for AAV7 and the other novel serotypes herein. 

Figure 1 provides the non-human primate (NHP) AAV nucleic acid 
sequences of the invention in an alignment with the previously published AAV serotypes, 
AAV 1 [SEQ IDNO:6], AAV2 [SEQ ID NO:7], and AAV3 [SEQ ID NO:8], These novel 
NHP sequences include those provided in the following Table I, which are identified by 

20 clone number: 

Table 1 



AAV Cap 
Sequence 


Clone 
Number 


Source 








Species 


Tissue 


SEQ ID NO 
(DNA) . 


Rh.l 


Clone 9 
(AAV9) 


Rhesus 


Heart 


5 


Rli.2 


Clone 43.1 


Rhesus 


MLN 


39 


Rh.3 


Clone 43.5 


Rhesus 


MLN 


40 


Rh.4 


Clone 43.12 


Rhesus 


MLN 


41 


Rh.5 


Clone 43.20 


Rhesus 


MLN 


42 


Rh.6 


Clone 43.21 


Rhesus 


MLN 


43 


Rli.7 


Clone 43.23 


Rhesus 


MLN 


44 
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Rh.8 


Clone 43.25 


Rhesus 


MLN 


45 


Rh.9 


Clone 44.1 


Rhesus 


Liver 


46 


Rh.10 


Clone 44.2 


Rhesus 


Liver 


59 


Rli.] 1 


Clone 44.5 


Rhesus 


Liver 


47 


Rh.12 


Clone 
42.1B 


Rhesus 


MLN 


30 


Rh.13 


42.2 


Rhesus 


MLN 


9 


RJi.14 


Clone 
42.3A 


Rhesus 


MLN 


32 


RJi.15 


Clone 
42.3B 


Rhesus 


MLN 


36 • 


Rli.16 


Clone 42.4 


Rhesus 


MLN 


33 


Rh.17 


Clone 
42.5A 


Rhesus 


MLN 


34 


Rh.18 


Clone 
42.5B 


Rhesus 


MLN 


29 


Rh.19 


Clone 
42.6B 


Rhesus 


MLN 


38 


Rli.20 


Clone 42.8 


Rhesus 


MLN 


27 


Rh.21 


Clone 42.10 


Rhesus 


MLN 


35 


RJi.22 


Clone 42.11 


Rhesus 


MLN 


37 


Rh.23 


Clone 42.12 


Rhesus 


MLN 


58 


Rh.24 


Clone 42.13 


Rhesus 


MLN 


31 


Rh.25 


Clone 42.15 


Rhesus 


MLN 


28 


Rh.26 


Clone 223.2 


Rhesus 


Liver 


49 


Rh.27 


Clone 223.4 


Rhesus 


Liver 


50 


Rh.28 


Clone 223.5 


Rhesus 


Liver 


51 


Rh.29 


Clone 223.6 


Rhesus 


Liver 


52 


Rh.30 


Clone 223.7 


Rhesus 


Liver 


53 


Rh.31 


Clone 
223.10 


Rhesus 


Liver 


48 


Rli.32 


Clone CI 


Rhesus 


Spleen, Duo, 
Kid & Liver 


19 


Rh.33 


Clone C3 


Rhesus 




20 


Rh.34 


Clone C5 


Rhesus 




21 


Rh.35 


Clone Fl 


Rhesus 


Liver 


22 


Rh.36 


Clone F3 


Rhesus 




23 


Rh.37 


Clone F5 


Rhesus 




24 


Cy.l 


Clone 1.3 


Cyno 


Blood 


14 


Cy.2 


Clone 
13.3B 


Cyno 


Blood 


15 


Cy.3 


Clone 24.1 


Cyno 


Blood 


16 


Cy.4 


Clone 27.3 


Cyno 


Blood 


17 


Cy.5 


Clone 7.2 


Cyno 


Blood 


18 


Cy.6 


Clone 16.3 


Cyno 


Blood 


10 
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Table 1 (cont'd) 



hh 1 
UD.l 




ijauoon 


JDlOOG 


1 1 


hb 2 


Clone 29 5 




Rlnnrl 

JJ 1UUU 


1 J 


Ch.l 


Clone A3.3 


Chimp 


Blood 


57 


Ch.2 


Clone A3.4 


Chimp 


Blood 


54 


Ch.3 


Clone A3.5 


Chimp 


Blood 


55 


Ch.4 


Clone A3. 7 


Chimp 


Blood 


56 



A novel NHP clone was made by splicing capsids fragments of two chimp 
adenoviruses into an AAV2 rep construct. This new clone, A3.1, is also termed Ch.5 [SEQ 
ID NO:20]. Additionally, the present invention includes two human AAV sequences, termed 

5 H6 [SEQ ID NO:25] and H2 [SEQ ID NO:26]. 

The AAV nucleic acid sequences of the invention further encompass the 
strand which is complementary to the strands provided in the sequences provided in Fig. 1 
and the Sequence Listing [SEQ ID NO;] 3 9-59, 117 -120], nucleic acid sequences, as well 
as the RNA and cDNA sequences corresponding to the sequences provided in Fig. 1 and the 

10 Sequence Listing [SEQ ID NOT, 9-59, 117-120], and their complementary strands. Also 
included in the nucleic acid sequences of the invention are natural variants and engineered 
modifications of the sequences of Figl and the Sequence Listing [SEQ ID NOT, 9 - 59, 
1 17-120], and their complementary strands. Such modifications include, for example, labels 
which are known in the art, methylation, and substitution of one or more of the naturally 

15 occurring nucleotides with a degenerate nucleotide. 

Further included in this invention are nucleic acid sequences which are 
greater than 85%, preferably at least about 90%, more preferably at least about 95%, and 
most preferably at least about 98 to 99% identical or homologous to the sequences of the 
invention, including Fig. 1 and the Sequence Listing [SEQ ID NO:l, 9 - 59, 11 7- 120]. 

20 These terms are as defined herein. 

Also included within the invention are fragments of the novel AAV 
sequences identified by the method described herein. Suitable fragments are at least 15 
nucleotides in length, and encompass functional fragments, i.e., fragments which are of 
biological interest. In one embodiment, these fragments are fragments of the novel 

25 sequences of Fig. 1 and the Sequence Listing [SEQ ID NOT, 9 - 59, 1 17-120], their 
complementary strands, cDNA and RNA complementary thereto. 

Examples of suitable fragments are provided with respect to the location of 
these fragments on AAV1, AAV2, or AAV7. However, using the alignment provided herein 
(obtained using the Clustal W program at default settings), or similar techniques for 
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generating an alignment with other novel serotypes of the invention, one of skill in the art 
can readily identify the precise nucleotide start and stop codons for desired fragments. 

Examples of suitable fragments include the sequences encoding the three 
variable proteins (vp) of the AAV capsid which are alternative splice variants: vpl [e.g., nt 

5 825 to 3049 of AAV7, SEQ ID NO: 1]; vp2 [e.g., nt 1234 - 3049 of AAV7, SEQ ID NO: J]; 
and vp 3 [e.g., nt 1434 - 3049 of AAV7, SEQ ID NO:l]. It is notable that AAV7 has an 
unusual GTG start codon. With the exception of a few house-keeping genes, such a start 
codon has not previously been reported in DNA viruses. The start codons for vpl, vp2 and 
vp3 for other AAV serotypes have been believed to be such that they permit the cellular 

10 mechanism of the host cell in which they reside to produce vpl, vp2 and vp3 in a ratio of 
10%:10%:80%, respectively, in order to permit efficient assembly of the virion. However, 
the AAV7 virion has been found to assemble efficiently even with this rare GTG start codon. 
Thus, the inventors anticipate this it is desirable to alter the start codon of the vp3 of other 
AAV serotypes to contain this rare GTG start codon, in order to improve packaging 

15 efficiency, to alter the virion structure and/or to alter location of epitopes (e.g., neutralizing 
antibody epitopes) of other AAV serotypes. The start codons may be altered using 
conventional techniques including, e.g., site directed mutagenesis. Thus, the present 
invention encompasses altered AAV virions of any selected serotype, composed of a vp 3, 
and/or optionally, vp 1 and/or vp2 having start codons altered to GTG. 

20 Other suitable fragments of AAV, include a fragment containing the start 

codon for the AAV capsid protein [e.g., nt 468 to 3090 of AAV7, SEQ ID NO:l, nt 725 to 
3090 of AAV7, SEQ ID NO: 1, and corresponding regions of the other AAV serotypes]. 
Still other fragments of AAV7 and the other novel AAV serotypes identified using the 
methods described herein include those encoding the rep proteins, including rep 78 [e.g., 

25 initiation codon 334 of Fig 1 for AAV7], rep 68 [initiation codon nt 334 of Fig. 1 for 
AAV7], rep 52 [initiation codon 1006 of Fig. 1 for AAV7], and rep 40 [initiation codon 
1006 of Fig. 1 for AAV7] Other fragments of interest may include the AAV 5' inverted 
terminal repeats ITRs, [nt 1 to 107 of Fig. 1 for AAV7]; the AAV 3' ITRs [nt 4704 to 4721 
of Fig. 1 for AAV7], P19 sequences, AAV P40 sequences, the rep binding site, and the 

30 terminal resolute site (TRS). Still other suitable fragments will be readily apparent to those 
of skill in the art. The corresponding regions in the other novel serotypes of the invention 
can be readily determined by reference to Figure 1, or by utilizing conventional alignment 
techniques with the sequences provided herein. 
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In addition to including the nucleic acid sequences provided in the figures 
and Sequence Listing, the present invention includes nucleic acid molecules and sequences' 
which are designed to express the amino acid sequences, proteins and peptides of the AAV 
serotypes of the invention. Thus, the invention includes nucleic acid sequences which 

5 encode the following novel AAV amino acid sequences: CI [SEQ ID NO:60], C2 [SEQ ID 
NO:61], C5 [SEQ ID NO:62], A3-3 [SEQ ID NO:66], A3-7 [SEQ ID NO:67], A3-4 [SEQ 
ID NO:68], A3-5 [SEQ ID NO: 69], 3.3b [SEQ ID NO: 62], 223.4 [SEQ ID NO: 73], 223-5 
[SEQ ID NO:74], 223-10 [SEQ ID NO:75], 223-2 [SEQ ID NO:76], 223-7 [SEQ ID NO: 
77], 223-6 [SEQ ID NO: 78], 44-1 [SEQ ID NO: 79], 44-5 [SEQ ID NO:80] 3 44-2 [SEQ ID 

10 NO:81], 42-15 [SEQ ID NO: 84], 42-8 [SEQ ID NO: 85], 42-13 [SEQ ID NO:86], 42-3A 
[SEQ ID NO:87], 42-4 [SEQ ID NO:88], 42-5A [SEQ ID NO:89], 42-1B [SEQ ID NO:90J, 
42-5B [SEQ ID NO:91], 43-1 [SEQ ID NO: 92], 43-12 [SEQ ID NO: 93], 43-5 [SEQ ID 
NO:94], 43-21 [SEQ ID NO:96], 43-25 [SEQ ID NO: 97], 43-20 [SEQ ID NO:99], 24. 1 
[SEQ ID NO: 101], 42.2 [SEQ ID NO:102], 7.2 [SEQ ID NO: 103], 27.3 [SEQ ID NO: 104], 

15 16.3 [SEQ ID NO: 105], 42.10 [SEQ ID NO: 106], 42-3B [SEQ ID NO: 107], 42-11 [SEQ 
ID NO: 108], Fl [SEQ ID NO: 109], F5 [SEQ ID NO: 1 10], F3 [SEQ ID NO:l 1 1], 42-6B 
[SEQ ID NO: 1 12], and/or 42-12 [SEQ ID NO: 1 13], and artificial AAV serotypes generated 
using these sequences and/or unique fragments thereof. 

As used herein, artificial AAV serotypes include, without limitation, AAV 

20 with a non-naturally occurring capsid protein. Such an artificial capsid may be generated by 
any suitable technique, using a novel AAV sequence of the invention (e.g., a fragment of a 
vpl capsid protein) in combination with heterologous sequences which may be obtained 
from another AAV serotype (known or novel), non-contiguous portions of the same AAV 
serotype, from a non-AAV viral source, or from a non-viral source. An artificial AAV 

25 serotype may be, without limitation, a chimeric AAV capsid, a recombinant AAV capsid. or 
a "humanized" AAV capsid. 

B. AAV Amino Acid Sequences, Proteins and Peptides 

The invention provides proteins and fragments thereof which are encoded by 
the nucleic acid sequences of the novel AAV serotypes identified herein, including, e.g., 

30 AAV7 [nt 825 to 3049 of AAV7, SEQ ID NO: 1] the other novel serotypes provided herein. 
Thus, the capsid proteins of the -novel serotypes of the invention, including: H6 [SEQ ID 
NO: 25], H2 [SEQ ID NO: 26], 42-2 [SEQ ID NO:9], 42-8 [SEQ ID NO:27], 42-1 5 [SEQ ID 
NO:28], 42-5b [SEQ ID NO: 29], 42-lb [SEQ ID NO:30]; 42-13 [SEQ ID NO: 3 1], 42-3a 
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[SEQ ID NO: 32], 42-4 [SEQ ID NO:33], 42-5a [SEQ ID NO: 34], 42-10 [SEQ ID NO:35], 
42-3b [SEQ ID NO: 36], 42-1 1 [SEQ ID NO: 37], 42-6b [SEQ ID NO:38], 43-1 [SEQ ID 
NO: 39], 43-5 [SEQ ID NO: 40], 43-12 [SEQ ID NO:41], 43-20 [SEQ IDNO:42], 43-21 
[SEQ ID NO: 43], 43-23 [SEQ ID NO:44], 43-25 [SEQ ID NO: 45], 44.1 [SEQ ID NO:47], 

5 44.5 [SEQ ID NO:47], 223.10 [SEQ ID NO:48], 223.2 [SEQ ID NO:49], 223.4 [SEQ ID 
NO:50], 223.5 [SEQ ID NO: 51], 223.6 [SEQ ID NO: 52], 223.7 [SEQ ID NO: 53], A3.4 
[SEQ ID NO: 54], A3.5 [SEQ ID NO:55], A3.7 [SEQ ID NO: 56], A3.3 [SEQ ID NO:57], 
42.12 [SEQ ID NO: 58], and 44.2 [SEQ ID NO: 59], can be readily generated using 
conventional techniques from the open reading frames provided for the above-listed clones. 

10 The invention further encompasses AAV serotypes generated using 

sequences of the novel AAV serotypes of the invention, which are generated using synthetic, 
recombinant or other techniques known to those of skill in the art. The invention is not 
limited to novel AAV amino acid sequences, peptides and proteins expressed from the novel 
AAV nucleic acid sequences of the invention and encompasses amino acid sequences, 

15 peptides and proteins generated by other methods known in the art, including, e.g., by 
chemical synthesis, by other synthetic techniques, or by other methods. For example, the 
sequences of any of CI [SEQ ID NO:60], C2 [SEQ ID NO:61], C5 [SEQ ID NO:62], A3-3 
[SEQ ID NO:66], A3-7 [SEQ ID NO:67], A3-4 [SEQ ID NO:68], A3-5 [SEQ ID NO: 69], 
3.3b [SEQ ID NO: 62], 223.4 [SEQ ID NO: 73], 223-5 [SEQ ID NO:74], 223-10 [SEQ ID 

20 NO:75], 223-2 [SEQ ID NO:76], 223-7 [SEQ ID NO: 77], 223-6 [SEQ ID NO: 78], 44-1 
[SEQ ID NO: 79], 44-5 [SEQ ID NO:80], 44-2 [SEQ ID NO:8I], 42-15 [SEQ ID NO: 84], 
42-8 [SEQ ID NO: 85], 42-13 [SEQ ID NO:86], 42-3A [SEQ ID NO:87], 42-4 [SEQ ID 
NO:88], 42-5A [SEQ ID NO:89], 42-1B [SEQ ID NO:90], 42-5B [SEQ ID NO:91], 43-1 
[SEQ ID NO: 92], 43-12 [SEQ ID NO: 93], 43-5 [SEQ ID NO:94], 43-21 [SEQ ID NO:96], 

25 43-25 [SEQ ID NO: 97], 43-20 [SEQ ID NO:99], 24.1 [SEQ ID NO: 101], 42.2 [SEQ ID 
NO:102],7.2 [SEQ ID NO: 103], 27.3 [SEQ ID NO: 104], 16.3 [SEQ ID NO: 105], 42.10 
[SEQ ID NO: 106], 42-3B [SEQ ID NO: 107], 42-11 [SEQ ID NO: 108], Fl [SEQ ID NO: 
109], F5 [SEQ ID NO: 110], F3 [SEQ ID NO:l 11], 42-6B [SEQ ID NO: 1 12], and/or 42-12 
[SEQ ID NO: 1 13] by be readily generated using a variety of techniques. 

30 Suitable production techniques are well known to those of skill in the art. 

See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Press (Cold Spring Harbor, NY). Alternatively, peptides can also be synthesized by the well 
known solid phase peptide synthesis methods (Merrifield, J. Am. Chem. Soc, 85:2149 
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(1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San Francisco, 1969) 
pp. 27-62). These and other suitable production methods are within the knowledge of those 
of skill in the art and are not a limitation of the present invention. 

Particularly desirable proteins include the AAV capsid proteins, which are 

5 encoded by the nucleotide sequences identified above. The sequences of many of the capsid 
proteins of the invention are provided in an alignment in Fig. 2 and/or in the Sequence 
Listing. SEQ ID NO: 2 and 60 to 115, which is incorporated by reference herein. The AAV 
capsid is composed of three proteins, vpl, vp2 and vp3, which are alternative splice variants. 
The full-length sequence provided in these figures is that of vpl . Based on the numbering of 

0 the AAV7 capsid [SEQ ID NO:2], the sequences of vp2 span amino acid 138 - 737 of AAV7 
and the sequences of vp3 span amino acids 203 - 737 of AA V7. With this information, one 
of skill in the art can readily determine the location of the vp2 and vp3 proteins for the other 
novel serotypes of the invention. 

Other desirable proteins and fragments of the capsid protein include the 

5 constant and variable regions, located between hypervariable regions (HPV) and the 

sequences of the HPV regions themselves. An algorithm developed to determine areas of 
sequence divergence in AAV2 has yielded 12 hypervariable regions (HVR) of which 5 
overlap or are part of the four previously described variable regions. [Chiorini et al, J. Virol, 
73:1309-19 (1999); Rutledge etal, J. Virol, 72:309-319] Using this algorithm and/or the 

0 alignment techniques described herein, the HVR of the novel AAV serotypes are determined. 
For example, with respect to the number of the AAV2 vpl [SEQ ID NO:70], the HVR are 
located as follows: HVR1, aa 146-152; HVR2, aa 182-186; HVR3, aa 262-264; HVR4, aa 
381-383; HVR5, aa 450-474; HVR6, aa 490-495; HVR7, aa500-504; HVR8, aa 5 14-522; 
HVR9, aa 534-555; HVR10, aa 581-594; HVR1 1, aa 658-667; and HVR12, aa 705-719. 

5 Utilizing an alignment prepared in accordance with conventional methods and the novel 

sequences provided herein [See, e.g., Figure 2], one can readily determine the location of the 
HVR in the novel AAV serotypes of the invention. For example, utilizing Figure 2, one can 
readily determine that for AAV7 [SEQ ID NO:2]. HVR1 is located at aa 146 - 1 52; HVR2 is 
located at 182-187; HVR3 is located at aa 263-266, HVR4 is located at aa 383-385, HVR5 is 

0 located at aa 451-475; HVR6 is located at aa 491-496 of AAV7; HVR7 is located at aa 501- 
505; HVR8 is located at aa 513-521; HVR9 is located at 533-554; HVR10 is located at aa 
583-596; HVR1 1 is located at aa 660-669; HVR 12 is located at aa 707-721 . Using the 
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information provided herein, the HVRs for the other novel serotypes of the invention can be 
readily determined. 

In addition, within the capsid, amino acid cassettes of identity have been 
identified. These cassettes are of particular interest, as they are useful in constructing 

5 artificial serotypes, e.g., by replacing a HVR1 cassette of a selected serotype with an HVR1 
cassette of another serotype. Certain of these cassettes of identity are noted in Fig, 2. See, 
Fig. 2, providing the Clustal X alignment, which has a ruler is displayed below the 
sequences, starting at 1 for the first residue position. The line above the ruler is used to mark 
strongly conserved positions. Three characters (*, : , .) are used. "*" indicates positions 

10 which have a single, fully conserved residue. ":" indicates that a "strong" group is fully 
conserved "." Indicates that a "weaker" group is fully conserved. These are all the 
positively scoring groups that occur in the Gonnet Pam250 matrix. The strong groups are 
defined as a strong score >0.5 and the weak groups are defined as weak score <0.5. 

Additionally, examples of other suitable fragments of AAV capsids include, 

15 with respect to the numbering of AAV2 [SEQ IDNO:70], aa24-42, aa25 -28; aa 81 - 85; 
aal33-165; aa 134 - 165; aa 137-143; aa 154-156; aa 194-208; aa 261-274; aa 262-274; aa 
171-173; aa 413-417; aa 449-478; aa 494-525; aa 534-571 ; aa 58 1-601; aa 660-671 ; aa 709- 
723. Still other desirable fragments include, for example, in AAV7, amino acids 1 to 184 of 
SEQ ID NO:2, amino acids 199 to 259; amino acids 274 to 446; amino acids 603 to 659; 

20 amino acids 670 to 706; amino acids 724 to 736; aa 185 to 198; aa 260 to 273; aa447 to 477; 
aa495 to 602; aa660 to 669; and aa707 to 723/ Still other desirable regions, based on the 
numbering of AAV7 [SEQ ID NO:2], are selected from among the group consisting of aa 
185 to 198; aa 260 to 273; aa447 to 477;aa495 to 602; aa660 to 669; and aa707 to 723. 
Using the alignment provided herein performed using the Clustal X program at default 

25 settings, or using other commercially or publicly available alignment programs at default 
settings, one of skill in the art can readily determine corresponding fragments of the novel 
AAV capsids of the invention. 

Other desirable proteins are the AAV rep proteins [aa 1 to 623 of SEQ ID 
NO:3 for AAV7] and functional fragments thereof, including, e.g., aa 1 to 171, aa 172 to 

30 372, aa 373 to 444, aa 445 to 623 of SEQ ID NO:3, among others. Suitably, such fragments 
are at least 8 amino acids in length. See, Fig. 3. Comparable regions can be identified in the 
proteins of the other novel AAV of the invention, using the techniques described herein and 
those which are known in the art. In addition, fragments of other desired lengths may be 
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readily utilized. Such fragments may be produced recombinantly or by other suitable means, 
e.g., chemical synthesis. 

The sequences, proteins, and fragments of the invention may be produced by 
any suitable means, including recombinant production, chemical synthesis, or other synthetic 
5 means. Such production methods are within the knowledge of those of skill in the art and 
are not a limitation of the present invention. 

IV. Production of r AAV with novel AAV capsids 

The invention encompasses novel, wild-type AAV serotypes identified by the 

10 invention, the sequences of which wild-type AAV serotypes are free of DNA and/or cellular 
material with these viruses are associated in nature. In another aspect, the present invention 
provides molecules which utilize the novel AAV sequences of the invention, including 
fragments thereof, for production of molecules useful in delivery of a heterologous gene or 
other nucleic acid sequences to a target cell. 

15 The molecules of the invention which contain sequences of a novel AAV serotype of 

the invention include any genetic element (vector) which may be delivered to a host cell, 
e.g., naked DNA, a plasmid, phage, transposon, cosmid, episome, a protein in a non-viral 
delivery vehicle (e.g., a lipid-based carrier), virus, etc. .which transfer the sequences carried 
thereon. The selected vector may be delivered by any suitable method, including 

20 transfection, electroporation, liposome delivery, membrane fusion techniques, high velocity 
DNA-coated pellets, viral infection and protoplast fusion. The methods used to construct 
any embodiment of this invention are known to those with skill in nucleic acid manipulat ion 
and include genetic engineering, recombinant engineering, and synthetic techniques. See, 
e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, 

25 Cold Spring Harbor, NY. 

In one embodiment, the vectors of the invention contain sequences encoding a novel 
AAV capsid of the invention (e.g., AAV7 capsid, AAV 44-2 (rh.10), an AAV 10 capsid, an 
AAV1 1 capsid, an AAV12 capsid), or a fragment of one or more of these AAV capsids. 
Alternatively, the vectors may contain the capsid protein, or a fragment thereof, itself 

30 Optionally, vectors of the invention may contain sequences encoding AAV rep 

proteins. Such rep sequences may be from the same AAV serotype which is providing the 
cap sequences. Alternatively, the present invention provides vectors in which the rep 
sequences are from an AAV serotype which differs from that which is providing the cap 
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sequences. In one embodiment, the rep and cap sequences are expressed from separate 
sources (e.g., separate vectors, or a host cell and a vector). In another" embodiment, these rep 
sequences are expressed from the same source as the cap sequences. In this embodiment, the 
rep sequences may be fused in frame to cap sequences of a different AAV serotype to form a 

5 chimeric AAV vector. Optionally, the vectors of the invention further contain a minigene 
comprising a selected transgene which is flanked by AAV 5' ITR and AAV 3' ITR. 

Thus, in one embodiment, the vectors described herein contain nucleic acid 
sequences encoding an intact AAV capsid which may be from a single AAV serotype (e.g., 
AAV7 or another novel AAV). Alternatively, these vectors contain sequences encoding 

10 artificial capsids which contain one or more fragments of the AAV7 (or another novel AAV) 
capsid fused to heterologous AAV or non-AAV capsid proteins (or fragments thereof). 
These artificial capsid proteins are selected from non-contiguous portions of the AAV7 (or 
another novel AAV) capsid or from capsids of other AAV serotypes. For example, it may be 
desirable to modify the coding regions of one or more of the AAV vpl, e.g., in one or more 

15 of the hypervariable regions (i.e., HPV1-12), or vp2, and/or vp3. In another example, it may 
be desirable to alter the start codon of the vp3 protein to GTG, These modifications may be 
to increase expression, yield, and/or to improve purification in the selected expression 
systems, or for another desired purpose (e.g., to change tropism or alter neutralizing antibody 
epitopes). 

20 The vectors described herein, e.g., a plasmid, are useful for a variety of purposes, but 

are particularly well suited for use in production of a rAAV containing a capsid comprising 
AAV sequences or a fragment thereof. These vectors, including rAAV, their elements, 
construction, and uses are described in detail herein. 

In one aspect, the invention provides a method of generating a recombinant adeno- 

25 associated virus (AAV) having an AAV serotype 7 (or another novel AAV) capsid, or a 
portion thereof. Such a method involves culturing a host cell which contains a nucleic acid 
sequence encoding an adeno-associated virus (AAV) serotype 7 (or another novel AAV) 
capsid protein, or fragment thereof, as defined herein; a functional rep gene; a minigene 
composed of, at a minimum, AAV inverted terminal repeats (ITRs) and a transgene; and 

30 sufficient helper functions to permit packaging of the minigene into the AAV7 (or another 
novel AAV) capsid protein. 

The components required to be cultured in the host cell to package an AAV minigene 
in an AAV capsid may be provided to the host cell in trans. Alternatively, any one or more 
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of the required components (e.g., minigene, rep sequences, cap sequences, and/or helper 
functions) may be provided by a stable host cell which has been engineered to contain one or 
more of the required components using methods known to those of skill in the art. Most 
suitably, such a stable host cell will contain the required component(s) under the control of 

5 an inducible promoter. However, the required component(s) may be under the control of a 
constitutive promoter. Examples of suitable inducible and constitutive promoters are 
provided herein, in the discussion of regulatory elements suitable for use with the transgene. 
In still another alternative, a . selected stable host cell may contain selected component(s) 
under the control of a constitutive promoter and other selected component(s) under the 

10 control of one or more inducible promoters. For example, a stable host cell may be 

generated which is derived from 293 cells (which contain El helper functions under the 
control of a constitutive promoter), but which contains the rep and/or cap proteins under the 
control of inducible promoters. Still other stable host cells may be generated by one of skill 
in the art. 

1 5 The minigene, rep sequences, cap sequences, and helper functions required for 

producing the rAAV of the invention may be delivered to the packaging host cell in the form 
of any genetic element which transfer the sequences carried thereon. The selected genetic 
element may be delivered by any suitable method, including those described herein. The 
methods used to construct any embodiment of this invention are known to those with skill in 

20 nucleic acid manipulation and include genetic engineering, recombinant engineering, and 
synthetic techniques. See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Press, Cold Spring Harbor, NY. Similarly, methods of generating 
rAAV virions are well known and the selection of a suitable method is not a limitation on the 
present invention. See, e.g., K. Fisher et al, J. Virol.. 70:520-532 (1993) and US Patent 

25 5,478,745. 

A. The Minigene 

The minigene is composed of, at a minimum, a transgene and its 
regulatory sequences, and 5' and 3' AAV inverted terminal repeats (ITRs). It is this 
minigene which is packaged into a capsid protein and delivered to a selected host cell. 
30 1. The transgene 

The transgene is a nucleic acid sequence, heterologous to the 
vector sequences flanking the transgene, which encodes a polypeptide, protein, or other 
product, of interest. The nucleic acid coding sequence is operatively linked to regulatory 
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components in a manner which permits transgene transcription, translation, and/or 
expression in a host cell. 

The composition of the transgene sequence will depend upon 
the use to which the resulting vector will be put. For example, one type of transgene 

5 sequence includes a reporter sequence, which upon expression produces a detectable signal. 
Such reporter sequences include, without limitation, DNA sequences encoding p-lactamase, 
P-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein 
(GFP), chloramphenicol acetyltransferase (CAT), luciferase, membrane bound proteins 
including, for example, CD2, CD4, CDS, the influenza hemagglutinin protein, and others 

10 well known in the art, to which high affinity antibodies directed thereto exist or can be 

produced by conventional means, and fusion proteins comprising a membrane bound protein 
appropriately fused to an antigen tag domain from, among others, hemagglutinin or Myc. 

These coding sequences, when associated with regulatory 
elements which drive their expression, provide signals detectable by conventional means, 

15 including enzymatic, radiographic, colorimetric, fluorescence or other spectrograph^ assays, 
fluorescent activating cell sorting assays and immunological assays, including enzyme linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA) and immunohistochemistry. For 
example, where the marker sequence is the LacZ gene, the presence of the vector carrying 
the signal is detected by assays for beta-galactosidase activity. Where the transgene is green 

20 fluorescent protein or luciferase, the vector carrying the signal may be measured visually by 
color or light production in a luminometer. 

However, desirably, the transgene is a non-marker sequence 
encoding a product which is useful in biology and medicine, such as proteins, peptides, 
RNA, enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, dsRNA, 

25 ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful RNA 

sequence is a sequence which extinguishes expression of a targeted nucleic acid sequence in 
the treated animal. 

The transgene may be used to correct or ameliorate gene 
deficiencies, which may include deficiencies in which normal genes are expressed at less 
30 than normal levels or deficiencies in which the functional gene product is not expressed. A 
preferred type of transgene sequence encodes a therapeutic protein or polypeptide which is 
expressed in a host cell. The invention further includes using multiple transgenes, e.g., to 
correct or ameliorate a gene defect caused by a rnulti-subunit protein. In certain situations, a 
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different transgene may be used to encode each subunit of a protein, or to encode different 
peptides or proteins. This is desirable when the size of the DNA encoding the protein 
subunit is large, e.g., for an immunoglobulin, the platelet-derived growth factor, or a 
dystrophin protein. In order for the cell to produce the multi-subunit protein, a cell is 

5 infected with the recombinant virus containing each of the different subunits. Alternatively, 
different subunits of a protein may be encoded by the same transgene. In this case, a single 
transgene includes the DNA encoding each of the subunits, with the DNA for each subunit 
separated by an internal ribozyme entry site (IRES). This is desirable when the size of the 
DNA encoding each of the subunits is small, e.g., the total size of the DNA encoding the 

10 subunits and the IRES is less than five kilobases. As an alternative to an IRES, the DNA may 
be separated by sequences encoding a 2A peptide, which self-cleaves in a post-translational 
event. See, e.g., ML. Donnelly, et a!, J. Gen, Virol, 78(Pt 1): 13-21 (Jan 1997); Furler, S., et 
al, Gene Then, 8(1 1):864-873 (June 2001); Klump H, et aL, Gene Ther., 8(10):81 1-817 
(May 2001). This 2A peptide is significantly smaller than an IRES, making it well suited for 

15 use when space is a limiting factor. However, the selected transgene may encode any 
biologically active product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill in 
the art. The selection of the transgene is not considered to be a limitation of this invention. 

2. Regulatory Elements 

20 In addition to the major elements identified above for the 

minigene, the vector also includes conventional control elements necessary which are 
operably linked to the transgene in a manner which permits its transcription, translation 
and/or expression in a cell transfected with the plasmid vector or infected with the virus 
produced by the invention. As used herein, "operably linked" sequences include both 

25 expression control sequences that are contiguous with the gene of interest and expression 
control sequences that act in trans or at a distance to control the gene of interest. 

Expression control sequences include appropriate 
transcription initiation, termination, promoter and enhancer sequences; efficient RNA 
processing signals such as splicing and polyadenylation (polyA) signals; sequences that 

30 stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak 

consensus sequence); sequences that enhance protein stability; and when desired, sequences 
that enhance secretion of the encoded product. A great number of expression control 
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sequences, including promoters which are native, constitutive, inducible and/or tissue- 
specific, are known in the art and may be utilized. 

Examples of constitutive promoters include, without 
limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV 
5 enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, 
e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate reductase 
promoter, the p-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EFlct 
promoter [Invitrogen]. 

Inducible promoters allow regulation of gene expression 

1 0 and can be regulated by exogenously supplied compounds, environmental factors such as 
temperature, or the presence of a specific physiological state, e.g., acute phase, a 
particular differentiation state of the cell, or in replicating cells only. Inducible 
promoters and inducible systems are available from a variety of commercial sources, 
including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have 

1 5 been described and can be readily selected by one of skill in the art. Examples of 

inducible promoters regulated by exogenously supplied promoters include the zinc-inducible 
. sheep metallothionine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary 
tumor virus (MMTV) promoter, the T7 polymerase promoter system [WO 98/10088]; the 
ecdysone insect promoter [No et al, Proc. Natl Acad. Sci. USA, 93:3346-3351 (1996)], the 

20 tetracycline-repressible system [Gossen et al, Proc. Natl Acad Sci. USA, 89:5547-5551 

(1992)], the tetracycline-inducible system [Gossen et al y Science, 268:1766-1769 (1995), see 
also Harvey et al, Curr. Opin. Chem. Biol, 2:512-518 (1998)], the RU486-inducibie system 
[Wang et al, Nat. Biotech., 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 
(1997)] and the rapamycin-inducible system [Magari et al, J. Clin. Invest, 100:2865-2872 

25 (1997)]. Still other types of inducible promoters which may be useful in this context are 
those which are regulated by a specific physiological state, e.g., temperature, acute phase, a 
particular differentiation state of the cell, or in replicating cells only. 

In another embodiment, the native promoter for the transgene 
will be used. The native promoter may be preferred when it is desired that expression of the 

30 transgene should mimic the native expression. The native promoter may be used when 

expression of the transgene must be regulated temporally or developmental ly, or in a tissue- 
specific manner, or in response to specific transcriptional stimuli. In a further embodiment, 
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other native expression control elements, such as enhancer elements, polyadenylation sites or 
Kozak consensus sequences may also be used to mimic the native expression. 

Another embodiment of the transgene includes a transgene 
operably linked to a tissue-specific promoter. For instance, if expression in skeletal muscle 

5 is desired, a promoter active in muscle should be used. These include the promoters from 
genes encoding skeletal P-actin, myosin light chain 2A, dystrophin, muscle creatine kinase, 
as well as synthetic muscle promoters with activities higher than naturally-occurring 
promoters (see Li et al, Nat. Biotech., 17:241-245 (1999)). Examples of promoters that are 
tissue-specific are known for liver (albumin, Miyatake et al, J. Virol, 71:5124-32 (1997); 

10 hepatitis B virus core promoter, Sandig et al, Gene Ther., 3:1002-9 (1996); 

alpha-fetoprotein (AFP), Arbuthnot etal, Hum, Gene Ther., 7:1503-14 (1996)), bone 
osteocalcin (Stein et al, Mol Biol Rep., 24:185-96 (1997)); bone sialoprotein (Chen et al, 
J. Bone Miner. Res., 11:654-64 (1996)), lymphocytes (CD2, Hansal et al, J. Immunol, 
161:1063-8 (1998); immunoglobulin heavy chain; T cell receptor a chain), neuronal such as 

15 neuron-specific enolase (NSE) promoter (Andersen et al, Cell Mol Nenrobiol, 13:503-1 5 
(1993)), neurofilament light-chain gene (Piccioli etal, Proc. Natl Acad. Sci. USA, 
88:561 1-5 (1991)), and the neuron-specific vgf gene (Piccioli et al, Neuron, 15:373-84 
(1995)), among others. 

Optionally, plasmids carrying therapeutically useful 
20 transgenes may also include selectable markers or reporter genes may include sequences 
encoding geneticin, hygromicin or purimycin resistance, among others. Such selectable 
reporters or marker genes (preferably located outside the viral genome to be rescued by the 
method of the invention) can be used to signal the presence of the plasmids in bacterial cells, 
such as ampicillin resistance. Other components of the plasmid may include an origin of 
25 replication. Selection of these and other promoters and vector elements are conventional and 
many such sequences are available [see, e.g., Sambrook et al, and references cited therein]. 

The combination of the transgene, promoter/enhancer, and 5' 
and 3' ITRs is referred to as a "minigene" for ease of reference herein. Provided with the 
teachings of this invention, the design of such a minigene can be made by resort to 
30 conventional techniques. 

3. Delivery of the Minigene to a Packaging Host Cell 

The minigene can be carried on any suitable vector, e.g., a 
plasmid, which is delivered to a host cell. The plasmids useful in this invention may be 
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engineered such that they are suitable for replication and, optionally, integration in 
prokaryotic cells, mammalian cells, or both. These plasmids (or other vectors carrying the 
5' AAV ITR-heterologous molecuIe-3'ITR) contain sequences permitting replication of the 
minigene in eulcaryotes and/or prokaryotes and selection markers for these systems. 

5 Selectable markers or reporter genes may include sequences encoding geneticin, hygromicin 
or purimycin resistance, among others. The plasmids may also contain certain selectable 
reporters or marker genes that can be used to signal the presence of the vector in bacterial 
cells, such as ampicillin resistance. Other components of the plasmid may include an origin 
of replication and an ampiicon, such as the amplicon system employing the Epstein Barr 

10 virus nuclear antigen. This amplicon system, or other similar amplicon components permit 
high copy episomal replication in the cells. Preferably, the molecule carrying the minigene 
is transfected into the cell, where it may exist transiently. Alternatively, the minigene 
(carrying the 5' AAV ITR-heterologous moIecuIe-3' ITR) may be stably integrated into the 
genome of the host cell, either chromosomally or as an episome. In certain embodiments, 

15 the minigene may be present in multiple copies, optionally in head-to-head, head-to-tail, or 
tail-to-tail concatamers. Suitable transfection techniques are known and may readily be 
utilized to deliver the minigene to the host cell. 

Generally, when delivering the vector comprising the minigene by 
transfection, the vector is delivered in an amount from about 5 |ig to about 100 fig DNA, and 

20 preferably about 10 to about 50 (ig DNA to about 1 x 10 4 cells to about 1 x 10 13 cells, and 
preferably about 10^ cells. However, the relative amounts of vector DNA to host cells may 
be adjusted, taking into consideration such factors as the selected vector, the delivery method 
and the host cells selected. 

B. ■ Rep and Cap Sequences 

25 In addition to the minigene, the host cell contains the sequences 

which drive expression of the novel AAV capsid protein (e.g., AAV7 or other novel AAV 
capsid or an artificial capsid protein comprising a fragment of one or more of these capsids) 
in the host cell and rep sequences of the same serotype as the serotype of the AAV ITRs 
found in the minigene. The AAV cap and rep sequences may be independently obtained 

30 from an AAV source as described above and may be introduced into the host cell in any 
manner known to one in the art as described above. Additionally, when pseudotyping a 
novel AAV capsid of the invention, the sequences encoding each of the essential rep proteins 
may be supplied by the same AAV serotype, or the sequences encoding the rep proteins may 
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be supplied by different AAV serotypes (e.g., AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, 
or one of the novel serotypes identified herein). For example, the rep!8/68 sequences may 
be from AAV2, whereas the rep52/40 sequences may from AAV1 . 

In one embodiment, the host cell stably contains the capsid protein 

5 under the control of a suitable promoter, such as those described above. Most desirably, in 
this embodiment, the capsid protein is expressed under the control of an inducible promoter. 
In another embodiment, the capsid protein is supplied to the host cell in trans. When 
delivered to the host cell in trans, the capsid protein may be delivered via a plasmid which 
contains the sequences necessary to direct expression of the selected capsid protein in the 

10 host cell. Most desirably, when delivered to the host cell in trans, the plasmid carrying the 
capsid protein also carries other sequences required for packaging the rAAV, e.g., the rep 
sequences. 

In another embodiment, the host cell stably contains the rep 
sequences under the control of a suitable promoter, such as those described above. Most 

15 desirably, in this embodiment, the essential rep proteins are expressed under the control of an 
inducible promoter. In another embodiment, the rep proteins are supplied to the host cell in 
tram. When delivered to the host cell in trans, the rep proteins may be delivered via a 
plasmid which contains the sequences necessary to direct expression of the selected rep 
proteins in the host cell. Most desirably, when delivered to the host cell in frans, the plasmid 

20 carrying the capsid protein also carries other sequences required for packaging the rAAV, 
e.g., the rep and cap sequences. 

Thus, in one embodiment, the rep and cap sequences may be 
transfected into the host cell on a single nucleic acid molecule and exist stably in the cell as 
an episome. In another embodiment, the rep and cap sequences are stably integrated into the 

25 genome of the cell. Another embodiment has the rep and cap sequences transiently 

expressed in the host cell. For example, a useful nucleic acid molecule for such transfection 
comprises, from 5' to 3', a promoter, an optional spacer interposed between the promoter and 
the start site of the rep gene sequence, an AAV rep gene sequence, and an AAV cap gene 
sequence. 

30 Optionally, the rep and/or cap sequences may be supplied on a vector 

that contains other DNA sequences that are to be introduced into the host cells. For instance, 
the vector may contain the rAAV construct comprising the minigene. The vector may 
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comprise one or more of the genes encoding the helper functions, e.g., the adenoviral 
proteins El , E2a, and E40RF6, and the gene for VAI RNA. 

Preferably, the promoter used in this construct may be any of the 
constitutive, inducible or native promoters known to one of skill in the art or as discussed 

5 above. In one embodiment, an AAV P5 promoter sequence is employed. The selection of 
the AAV to provide any of these sequences does not limit the invention. 

In another preferred embodiment, the promoter for rep is an inducible 
promoter, as are discussed above in connection with the transgene regulatory elements. One 
preferred promoter for rep expression is the T7 promoter. The vector comprising the rep 

iO gene regulated by the T7 promoter and the cap gene, is transfected or transformed into a cell 
which either constitutively or inducibly expresses the T7 polymerase. See WO 98/1 0088. 
published March 12, 1998. 

The spacer is an optional element in the design of the vector. The 
spacer is a DNA sequence interposed between the promoter and the rep gene ATG start site. 

15 The spacer may have any desired design; that is, it may be a random sequence of nucleotides, 
or alternatively, it may encode a gene product, such as a marker gene. The spacer may 
contain genes which typically incorporate start/stop and polyA sites. The spacer may be a 
non-coding DNA sequence from a prokaryote or eukaryote, a repetitive non-coding 
sequence, a coding sequence without transcriptional controls or a coding sequence with 

20 transcriptional controls. Two exemplary sources of spacer sequences are the X phage ladder 
sequences or yeast ladder sequences, which are available commercially, e.g., from Gibco or 
Invitrogen, among others. The spacer may be of any size sufficient to reduce expression of 
the rep78 and rep68 gene products, leaving the rep52, rep40 and cap gene products 
expressed at normal levels. The length of the spacer may therefore range from about 10 bp 

25 to about 10.0 kbp, preferably in the r ange of about 100 bp to about 8.0 kbp. To reduce the 
possibility of recombination, the spacer is preferably less than 2 kbp in length; however, the 
invention is not so limited. 

Although the molecule(s) providing rep and cap may exist in the host 
cell transiently (i.e., through transfection), it is preferred that one or both of the rep and cap 

30 proteins and the promoter(s) controlling their expression be stably expressed in the host cell, 
e.g., as an episome or by integration into the chromosome of the host cell. The methods 
employed for constructing embodiments of this invention are conventional genetic 
engineering or recombinant engineering techniques such as those described in the references 
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above. While this specification provides illustrative examples of specific constructs, using 
the information provided herein, one of skill in the art may select and design other suitable 
constructs, using a choice of spacers, P5 promoters, and other elements, including at least 
one translational start and stop signal, and the optional addition of polyadenylation sites. 
5 In another embodiment of this invention, the rep or cap protein may 

be provided stably by a host cell. 

C. The Helper Functions 

The packaging host cell also requires helper functions in order to 
package the rAAV of the invention. Optionally, these functions may be supplied by a 

10 herpesvirus. Most desirably, the necessary helper functions are each provided from a human 
or non-human primate adenovirus source, such as those described above and/or are available 
from a variety of sources, including the American Type Culture Collection (ATCC), 
Manassas, VA (US). In one currently preferred embodiment, the host cell is provided with 
and/or contains an El a gene product, an Elb gene product, an E2a gene product, and/or an 

15 E4 ORF6 gene product. The host cell may contain other adenoviral genes such as VAI 
RNA, but these genes are not required. In a preferred embodiment, no other adenovirus 
genes or gene functions are present in the host cell. 

By "adenoviral DNA which expresses the El a gene product", it is 
meant any adenovirus sequence encoding Ela or any functional El a portion. Adenoviral 

20 DNA which expresses the E2a gene product and adenoviral DNA which expresses the E4 
ORF6 gene products are defined similarly. Also included are any alleles or other 
modifications of the adenoviral gene or functional portion thereof. Such modifications may 
be deliberately introduced by resort to conventional genetic engineering or mutagenic 
techniques to enhance the adenoviral function in some manner, as well as naturally occurring 

25 allelic variants thereof. Such modifications and methods for manipulating DNA to achieve 
these adenovirus gene functions are known to those of skill in the art. 

The adenovirus Ela, Elb, E2a, and/or E40RF6 gene products, as 
well as any other desired helper functions, can be provided using any means that allows their 
expression in a cell. Each of the sequences encoding these products may be on a separate 

30 vector, or one or more genes may be on the same vector. The vector may be any vector 

known in the art or disclosed above, including plasmids, cosmids and viruses. Introduction 
into the host cell of the vector may be achieved by any means known in the art or as 
disclosed above, including transfection, infection, electroporation, liposome delivery, 
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membrane fusion techniques, high velocity DNA-coated pellets, viral infection and 
protoplast fusion, among others. One or more of the adenoviral genes may be stably 
integrated into the genome of the host cell, stably expressed as episomes, or expressed 
transiently. The gene products may all be expressed transiently, on an episome or stably 

5 integrated, or some of the gene products may be expressed stably while others are expressed 
transiently. Furthermore, the promoters for each of the adenoviral genes may be selected 
independently from a constitutive promoter, an inducible promoter or a native adenoviral 
promoter. The promoters may be regulated by a specific physiological state of the organism 
or cell (i.e., by the differentiation state or in replicating or quiescent cells) or by 

10 exogenously-added factors, for example. 

D. Host Cells And Packaging Cell Lines 

The host cell itself may be selected from any biological organism, including 
prolcaryotic (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and 
mammalian cells. Particularly desirable host cells are selected from among any mammalian 

15 species, including, without limitation, cells such as A549, WEHI, 3T3, 10T1/2, BHK, 

MDCK, COS 1, COS 7, BSC 1, BSC 40, BMT 10, VERO, WI38, HeLa, 293 cells (which 
express functional adenoviral El), Saos, C2C12, L cells, HT1080, HepG2 and primary 
fibroblast, hepatocyte and myoblast cells derived from mammals including human, monkey, 
mouse, rat, rabbit, and hamster. The selection of the mammalian species providing the cells 

20 is not a limitation of this invention; nor is the type of mammalian cell, i.e., fibroblast, 

hepatocyte, tumor cell, etc. The most desirable cells do not carry any adenovirus gene other 
than El, E2a and/or E4 ORF6; nor do they contain any other virus gene which could result in 
homologous recombination of a contaminating virus during the production of rAAV; and it 
is capable of infection or transfection of DNA and expression of the transfected DNA. In a 

25 preferred embodiment, the host cell is one that has rep and cap stably transfected in the cell. 

One host cell useful in the present invention is a host cell stably 
transformed with the sequences encoding rep and cap, and which is transfected with the 
adenovirus El, E2a, and E40RF6 DNA and a construct carrying the minigene as described 
above. Stable rep and/or cap expressing cell lines, such as B-50 (PCT/US98/19463), or those 

30 described in U.S. Patent No. 5,658,785, may also be similarly employed. Another desirable 
host cell contains the minimum adenoviral DNA which is sufficient to express E4 ORF6. 
Yet other cell lines can be constructed using the novel AAV rep and/or novel AAV cap 
sequences of the invention. 
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The preparation of a host cell according to this invention involves 
techniques such as assembly of selected DNA sequences. This assembly may be 
accomplished utilizing conventional techniques. Such techniques include cDNA and 
genomic cloning, which are well known and are described in Sambrook et ah, cited above, 
5 use of overlapping oligonucleotide sequences of the adenovirus and AAV genomes, 
combined with polymerase chain reaction, synthetic methods, and any other suitable 
methods which provide the desired nucleotide sequence. 

Introduction of the molecules (as plasmids or viruses) into the host 
cell may also be accomplished using techniques known to the skilled artisan and as discussed 

10 throughout the specification. In preferred embodiment, standard transfection techniques are 
used, e.g., CaP0 4 transfection or electroporation, and/or infection by hybrid 
adenovirus/AAV vectors into cell lines such as the human embryonic kidney cell line HEK 
293 (a human kidney cell line containing functional adenovirus El genes which provides 
trans-acting El proteins). 

15 These novel AAV-based vectors which are generated by one of skill 

in the art are beneficial for gene delivery to selected host cells and gene therapy patients 
since no neutralization antibodies to AAV7 have been found in the human population. 
Further, early studies show no neutralizing antibodies in cyno monkey and chimpanzee 
populations, and less than 15% cross-reactivity of AAV 7 in rhesus monkeys, the species 

20 from which the serotype was isolated. One of skill in the art may readily prepare other 

rAAV viral vectors containing the AAV7 capsid proteins provided herein using a variety of 
techniques known to those of skill in the art. One may similarly prepare still other rAAV 
viral vectors containing AAV7 sequence and AAV capsids of another serotype. Similar 
advantages are conferred by the vectors based on the other novel AAV of the invention. 

25 Thus, one of skill in the art will readily understand that the AAV7 

sequences of the invention can be readily adapted for use in these and other viral vector 
systems for in vifro, ex vivo or in vivo gene delivery. Similarly, one of skill in the art can 
readily select other fragments of the novel AAV genome of the invention for use in a variety 
of rAAV and non-rAAV vector systems. Such vectors systems may include, e.g., 

30 Antiviruses, retroviruses, poxviruses, vaccinia viruses, and adenoviral systems, among 
others. Selection of these vector'systems is not a limitation of the present invention. 

Thus, the invention further provides vectors generated using the 
nucleic acid and amino acid sequences of the novel AAV of the invention. Such vectors are 
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useful for a variety of purposes, including for delivery of therapeutic molecules and for use 
in vaccine regimens. Particularly desirable for delivery of therapeutic molecules are 
recombinant AAV containing capsids of the novel AAV of the invention. These, or other 
vector constructs containing novel AAV sequences of the invention may be used in vaccine 
regimens, e.g., for co-delivery of a cytokine, or for delivery of the immunogen itself. 

V . Recombinant V iruses And Uses Thereof 

Using the techniques described herein, one of skill in the art may generate a rAAV 
having a capsid of a novel serotype of the invention, or a novel capsid containing one or 
more novel fragments of an AAV serotype identified by the method of the invention. In one 
embodiment, a full-length capsid from a single serotype, e.g., AAV7 [SEQ ID NO: 2] can be 
utilized. In another embodiment, a full-length capsid may be generated which contains one 
or more fragments of a novel serotype of the invention fused in frame with sequences from 
another selected AAV serotype. For example, a rAAV may contain one or more of the novel 
hypervariable region sequences of an AAV serotype of the invention. Alternatively, the 
unique AAV serotypes of the invention may be used in constructs containing other viral or 
non-viral sequences. 

It will be readily apparent to one of skill in the art one embodiment, that certain 
serotypes of the invention will be particularly well suited for certain uses. For example, 
vectors based on AAV7 capsids of the invention are particularly well suited for use in 
muscle; whereas vectors based on rh.10 (44-2) capsids of the invention are particularly well 
suited for use in lung. Uses of such vectors are not so limited and one of skill in the art may 
utilize these vectors for delivery to other cell types, tissues or organs. Further, vectors based 
upon other capsids of the invention may be used for delivery to these or other cells, tissues or 
organs. 

A. Delivery of Transgene 

In another aspect, the present invention provides a method for deliver)' of a 
transgene to a host which involves transfecting or infecting a selected host cell with a vector 
generated with the sequences of the AAV of the invention. Methods for delivery are well 
known to those of skill in the art and are not a limitation of the present invention. 

In one desirable embodiment, the invention provides a method for AAV- 
mediated delivery of a transgene to a host. This method involves transfecting or infecting a 
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selected host cell with a recombinant viral vector containing a selected transgene under the 
control of sequences which direct expression thereof and AAV capsid proteins. 

Optionally, a sample from the host may be first assayed for the presence of 
antibodies to a selected AAV serotype. A variety of assay formats for detecting neutralizing 

5 antibodies are well known to those of skill in the art. The selection of such an assay is not a 
limitation of the present invention. See, e.g., Fisher et al, Native Med., 3(3):306-312 (March 
1997) and W. C. Manning et al, Human Gene Therapy, 9:477-485 (March 1, 1998). The 
results of this assay may be used to determine which AAV vector containing capsid proteins 
of a particular serotype are preferred for delivery, e.g., by the absence of neutralizing 

10 antibodies specific for that capsid serotype. 

In one aspect of this method, the delivery of vector with a selected AAV 
capsid proteins may precede or follow delivery of a gene via a vector with a different 
serotype AAV capsid protein. Similarly, the delivery of vector with other novel AA V capsid 
proteins of the invention may precede or follow delivery of a gene via a vector with a 

15 different serotype AAV capsid protein. Thus, gene delivery via rAAV vectors may be used 
for repeat gene delivery to a selected host cell. Desirably, subsequently administered rAAV 
vectors carry the same transgene as the first rAAV vector, but the subsequently administered 
vectors contain capsid proteins of serotypes which differ from the first vector. For example, 
if a first vector has AAV7 capsid proteins [SEQ ID NO:2] 3 subsequently administered 

20 vectors may have capsid proteins selected from among the other serotypes, including AAV1 , 
AAV2, AAV3A, AAV3B, AAV4, AAV6, AAV10, AAV1 1, and AAV12, or any of the other 
novel AAV capsids identified herein including, without limitation: A3.1, H2, H6, CI, C2, 
C5, A3-3, A3-7, A3-4, A3-5, 3.3b, 223.4, 223-5, 223-10, 223-2, 223-7, 223-6, 44-1, 44-5, 
44-2, 42-15, 42-8, 42-13, 42-3A, 42-4, 42-5A, 42-1B, 42-5B, 43-1, 43-12, 43-5, 43-21, 43- 

25 25, 43-20, 24.1, 42.2, 7.2, 27.3, 16.3, 42.10, 42-3B, 42-11, Fl, F5, F3, 42-6B, and/or 42-12. 

The above-described recombinant vectors may be delivered to host cells 
according to published methods. The rAAV, preferably suspended in a physiologically 
compatible carrier, may be administered to a human or non-human mammalian patient. 
Suitable carriers may be readily selected by one of skill in the art in view of the indication 

30 for which the transfer virus is directed. For example, one suitable carrier includes saline, 
which may be formulated with a variety of buffering solutions (e.g., phosphate buffered 
saline). Other exemplary carriers include sterile saline, lactose, sucrose, calcium phosphate, 
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gelatin, dextran, agar, pectin, peanut oil, sesame oil, and water. The selection of the carrier 
is not a limitation of the present invention. 

Optionally, the compositions of the invention may contain, in addition to the 
rAAV and carrier(s), other conventional pharmaceutical ingredients, such as preservatives, 

5 or chemical stabilizers. Suitable exemplary preservatives include chlorobutanol, potassium 
sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, ethyl vanillin, glycerin, 
phenol, and parachlorophenol. Suitable chemical stabilizers include gelatin and albumin. 

The viral vectors are administered in sufficient amounts to transfect the cells 
and to provide sufficient levels of gene transfer and expression to provide a therapeutic 

1 0 benefit without undue adverse effects, or with medically acceptable physiological effects, 
which can be determined by those skilled in the medical arts. Conventional and 
pharmaceutically acceptable routes of administration include, but are not limited to, direct 
delivery to the selected organ (e.g., intraportal delivery to the liver), oral, inhalation 
(including intranasal and intratracheal delivery), intraocular, intravenous, intramuscular, 

15 subcutaneous, intradermal, and other parental routes of administration. Routes of 
administration may be combined, if desired. 

Dosages of the viral vector will depend primarily on factors such as the 
condition being treated, the age, weight and health of the patient, and may thus vary among 
patients. For example, a therapeutically effective human dosage of the viral vector is 

20 generally in the range of from about 1 ml to about 100 ml of solution containing 

concentrations of from about 1 x 10 9 to 1 x 10 16 genomes virus vector. A preferred human 
dosage may be about 1 x 10 13 to 1 x 10 16 AAV genomes. The dosage will be adjusted to 
balance the therapeutic benefit against any side effects and such dosages may vary 
depending upon the therapeutic application for which the recombinant vector is employed. 

25 The levels of expression of the transgene can be monitored to determine the frequency of 
dosage resulting in viral vectors, preferably AAV vectors containing the minigene. 
Optionally, dosage regimens similar to those described for therapeutic purposes may be 
utilized for immunization using the compositions of the invention. 

Examples of therapeutic products and immunogenic products for delivery by 

30 the AAV-containing vectors of the invention are provided below. These vectors may be 
used for a variety of therapeutic or vaccinal regimens, as described herein. Additionally, 
these vectors may be delivered in combination with one or more other vectors or active 
ingredients in a desired therapeutic and/or vaccinal regimen. 
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B. Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include hormones and 
growth and differentiation factors including, without limitation, insulin, glucagon, growth 
hormone (GH), parathyroid hormone (PTH), growth hormone releasing factor (GRF), 
follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic 
gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, angiostatin, 
granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), connective tissue 
growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic fibroblast growth factor 
(aFGF), epidermal growth factor (EGF), transforming growth factor a (TGFa), platelet- 
derived growth factor (PDGF), insulin growth factors I and II (IGF-I and IGF-II), any one of 
the transforming growth factor p superfamily, including TGF P, activins, inhibins, or any of 
the bone morphogenic proteins (BMP) BMPs 1-15, any one of the 

heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, nerve 
growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins NT-3 and 
NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic factor 
(GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 and 
netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and tyrosine 
hydroxylase. 

Other useful transgene products include proteins that regulate the immune 
system including, without limitation, cytokines and lympholcines such as thrombopoietin 
(TPO), interleulcins (IL) IL-1 through IL-25 (including, IL-2, IL-4, IL-12, and IL-18), 
monocyte chemoattractant protein, leukemia inhibitory factor, granulocyte-macrophage 
colony stimulating factor, Fas ligand, tumor necrosis factors a and p, interferons a, p, and y, 
stem cell factor, flk-2/flt3 ligand. Gene products produced by the immune system are also 
useful in the invention. These include, without limitations, immunoglobulins IgG, IgM, IgA, 
IgD and IgE, chimeric immunoglobulins, humanized antibodies, single chain antibodies, T 
cell receptors, chimeric T cell receptors, single chain T cell receptors, class I and class II 
MHC molecules, as well as engineered immunoglobulins and MHC molecules. Useful gene 
products also include complement regulatory proteins such as complement regulatory ' 
proteins, membrane cofactor protein (MCP), decay accelerating factor (DAF), CR1, CF2 and 
CD59. 

Still other useful gene products include any one of the receptors for the 
hormones, growth factors, cytokines, lympholcines, regulatory proteins and immune system 
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proteins. The invention encompasses receptors for cholesterol regulation, including the low 
density lipoprotein (LDL) receptor, high density lipoprotein (HDL) receptor, the very low 
density lipoprotein (VLDL) receptor, and the scavenger receptor. The invention also 
encompasses gene products such as members of the steroid hormone receptor superfamily 
5 including glucocorticoid receptors and estrogen receptors, Vitamin D receptors and other 
nuclear receptors. In addition, useful gene products include transcription factors such as jim, 
fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD and myogenin/ETS- 
box containing proteins, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF- 
4, C/EBP, SP1, CCAAT-box binding proteins, interferon regulation factor (IRF-1), Wilms 
10 tumor protein, ETS-binding protein, STAT, GATA-box binding proteins, e.g., GATA-3, and 
the forlchead family of winged helix proteins. 

Other useful gene products include, carbamoyl synthetase I, ornithine 
transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, 
fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha-] antitrypsin, gIucose-6- 
15 phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathione beta-synthase, 
branched chain ketoacid decarboxylase, albumin, isovaleryl-coA dehydrogenase, propionyl 
CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA dehydrogenase, insulin, beta- 
glucosidase, pyruvate carboxylate, hepatic phosphorylase, phosphorylase kinase, glycine 
decarboxylase, H-protein, T-protein, a cystic fibrosis transmembrane regulator (CFTR) 
20 sequence, and a dystrophin cDNA sequence. Still other useful gene products include 

enzymes such as may be useful in enzyme replacement therapy, which is useful in a variety 
of conditions resulting from deficient activity of enzyme. For example, enzymes that contain 
mannose-6-phosphate may be utilized in therapies for lysosomal storage diseases (e.g., a 
suitable gene includes that encoding p-glucuronidase (GUSB)). 
25 Other useful gene products include non-naturally occurring polypeptides, 

such as chimeric or hybrid polypeptides having a non-naturally occurring amino acid 
sequence containing insertions, deletions or amino acid substitutions. For example, single- 
chain engineered immunoglobulins could be useful in certain immunocompromised patients. 
Other types of non-naturally occurring gene sequences include antisense molecules and 
30 catalytic nucleic acids, such as ribozymes, which could be used to reduce overexpression of a 
target. 

Reduction and/or modulation of expression of a gene is particularly desirable 
for treatment of hyperproliferative conditions characterized by hyperproliferating cells, as 
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are cancers and psoriasis. Target polypeptides include those polypeptides which are 
produced exclusively or at higher levels in hyperproliferative cells as compared to normal 
cells. Target antigens include polypeptides encoded by oncogenes such as myb, myc, fyn, 
and the translocation gene bcr/abl, ras, src, P53, neu, trk and EGRF. In addition to oncogene 

5 products as target antigens, target polypeptides for anti-cancer treatments and protective 
regimens include variable regions of antibodies made by B cell lymphomas and variable 
regions of T cell receptors of T cell lymphomas which, in some embodiments, are also used 
as target antigens for autoimmune disease. Other tumor-associated polypeptides can be used 
as target polypeptides such as polypeptides which are found at higher levels in tumor cells 

10 including the polypeptide recognized by monoclonal antibody 17-1 A and folate binding 
polypeptides. 

Other suitable therapeutic polypeptides and proteins include those which may 
be useful for treating individuals suffering from autoimmune diseases and disorders by 
conferring a broad based protective immune response against targets that are associated with 
15 autoimmunity including cell receptors and cells which produce "self-directed antibodies. T 
cell mediated autoimmune diseases include Rheumatoid arthritis (RA), multiple sclerosis 
(MS), Sjogren's syndrome, sarcoidosis, insulin dependent diabetes mellitus (IDDM), 
autoimmune thyroiditis, reactive arthritis, ankylosing spondylitis, scleroderma, polymyositis, 
dermatomyositis, psoriasis, vasculitis, Wegener's granulomatosis, Crohn's disease and 
20 ulcerative colitis. Each of these diseases is characterized by T cell receptors (TCRs) that 
bind to endogenous antigens and initiate the inflammatory cascade associated with 
autoimmune diseases. 

C. Immunogenic Transgenes 

Alternatively, or in addition, the vectors of the invention may contain AAV 
25 sequences of the invention and a transgene encoding a peptide, polypeptide or protein which 
induces an immune response to a selected immunogen. For example, immunogens may be 
selected from a variety of viral families. Example of desirable viral families against which 
an immune response would be desirable include, the picornavirus family, which includes the 
genera rhinoviruses, which are responsible for about 50% of cases of the common cold; the 
30 genera enteroviruses, which include polioviruses, coxsackieviruses, echoviruses, and human 
enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are responsible 
for foot and mouth diseases, primarily in non-human animals. Within the picornavirus 
family of viruses, target antigens include the VP1 , VP2, VP3, VP4, and VPG. Another viral 
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family includes the calcivirus family, which encompasses theNorwalk group of viruses, 
which are an important causative agent of epidemic gastroenteritis. Still another viral family 
desirable for use in targeting antigens for inducing immune responses in humans and non- 
human animals is the togavirus family, which includes the genera alphavirus, which include 
5 Sindbis viruses, RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, 
and rubivirus, including Rubella virus. The flaviviridae family includes dengue, yellow 
fever, Japanese encephalitis, St. Louis encephalitis and tick borne encephalitis viruses. Other 
target antigens may be generated from the Hepatitis C or the coronavirus family, which 
includes a number of non-human viruses such as infectious bronchitis virus (poultry), 

10 porcine transmissible gastroenteric virus (pig), porcine hemagglutinating encephalomyelitis 
virus (pig), feline infectious peritonitis virus (cats), feline enteric coronavirus (cat), canine 
coronavirus (dog), and human respiratory coronaviruses, which may cause the common cold 
and/or non-A, B or C hepatitis. Within the coronavirus family, target antigens include the 
El (also called M or matrix protein), E2 (also called S or Spike protein), E3 (also called HE 

15 or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), or N (nucleocapsid). 
Still other antigens may be targeted against the rhabdovirus family, which includes the 
genera vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general lyssavirus (e.g., 
rabies). Within the rhabdovirus family, suitable antigens may be derived from the G protein 
or the N protein. The family filoviridae, which includes hemorrhagic fever viruses such as 

20 Marburg and Ebola virus may be a suitable source of antigens. The paramyxovirus family 
includes parainfluenza Virus Type 1, parainfluenza Virus Type 3, bovine parainfluenza 
Virus Type 3, rubulavirus (mumps virus, parainfluenza Virus Type 2, parainfluenza virus 
Type 4, Newcastle disease virus (chickens), rinderpest, morbillivirus, which includes 
measles and canine distemper, and pneumovirus, which includes respiratory syncytial virus. 

25 The influenza virus is classified within the family orthomyxovirus and is a suitable source of 
antigen (e.g., the HA protein, the Nl protein). The bunyavirus family includes the genera 
bunyavirus (California encephalitis, La Crosse), phlebovirus (Rift Valley Fever), hantavirus 
(puremala is a hemahagin fever virus), nairovirus (Nairobi sheep disease) and various 
unassigned bungaviruses. The arenavirus family provides a source of antigens against LCM 

30 and Lassa fever virus. The reovirus family includes the genera reovirus, rotavirus (which 
causes acute gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick fever, 
Lebombo (humans), equine encephalosis, blue tongue). 



« 
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The retrovirus family includes the sub-family oncorivirinal which 
encompasses such human and veterinary diseases as feline leukemia virus, HTLVI and 
HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), simian 
immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine infectious 
5 anemia virus, and spumavirinal). Between the HIV and SIV, many suitable antigens have 
been described and can readily be selected. Examples of suitable HIV and SIV antigens 
include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat and Rev proteins, as well 
as various fragments thereof. In addition, a variety of modifications to these antigens have 
been described. Suitable antigens for this purpose are known to those of skill in the art. For 

10 example, one may select a sequence encoding the gag, pol, Vif, and Vpr, Env, Tat and Rev, 
amongst other proteins. See, e.g., the modified gag protein which is described in US Patent 
5,972,596. See, also, the HIV and SIV proteins described in D.H. Barouch et al, J. Virol., 
75(5):2462-2467 (March 2001), and R,R. Amara, et al, Science, 292:69-74 (6 April 2001). 
These proteins or subunits thereof may be delivered alone, or in combination via separate 

15 vectors or from a single vector. 

The papovavirus family includes the sub-family polyomaviruses (BKU and 
JCU viruses) and the sub-family papillomavirus (associated with cancers or malignant 
progression of papilloma). The adenovirus family includes viruses (EX, AD7, ARD, O.B.) 
which cause respiratory disease and/or enteritis. The parvovirus family feline parvovirus 

20 (feline enteritis), feline panleucopeniavirus, canine parvovirus, and porcine parvovirus. The 
herpesvirus family includes the sub-family alphaherpesvirinae, which encompasses the 
genera simplexvirus (HSVI, HSVII), varicellovirus (pseudorabies, varicella zoster) and the 
sub-family betaherpesvirinae, which includes the genera cytomegalovirus (HCMV, 
muromegalovirus) and the sub-family gammaherpesvirinae, which includes the genera 

25 lymphocrypto virus, EBV (Burkitts lymphoma), infectious rhinotracheitis, Marek's disease 
virus, and rhadinovirus. The poxvirus family includes the sub-family chordopoxvirinae, 
which encompasses the genera orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), 
parapoxvirus, avipoxvirus, capripoxvirus, leporipoxvirus, suipoxvirus, and the sub-family 
entomopoxvirinae. The hepadnavirus family includes the Hepatitis B virus. One 

30 unclassified virus which may be suitable source of antigens is the Hepatitis delta virus. Still 
other viral sources may include avian infectious bursal disease virus and porcine respiratory 
and reproductive syndrome virus. The alphavirus family includes equine arteritis virus and 
various Encephalitis viruses. 
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The present invention may also encompass immunogens which areuseful to 
immunize a human or non-human animal against other pathogens including bacteria, fungi, 
parasitic microorganisms or multicellular parasites which infect human and non-human 
vertebrates, or from a cancer cell or tumor cell. Examples of bacterial pathogens include 
5 pathogenic gram-positive cocci include pneumococci; staphylococci; and streptococci. 
Pathogenic gram-negative cocci include meningococcus; gonococcus. Pathogenic enteric 
gram-negative bacilli include enterobacteriaceae; pseudomonas, acinetobacteria and 
eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; K ducreyi (which 
causes chancroid); brucella; Franisella tidarensis (which causes tularemia); yersinia 
10 (pasteurella); streptobacillus moniliformis and spirillum; Gram-positive bacilli include 
listeria monocytogenes; erysipelothrix rhusiopathiae; Corynebacterium diphtheria 
(diphtheria); cholera; B. anthracis (anthrax); donovanosis (granuloma inguinale); and 
bartonellosis. Diseases caused by pathogenic anaerobic bacteria include tetanus; botulism; 
other Clostridia; tuberculosis; leprosy; and other mycobacteria. Pathogenic spirochetal 
15 diseases include syphilis; treponematoses: yaws, pinta and endemic syphilis; and 

leptospirosis. Other infections caused by higher pathogen bacteria and pathogenic fungi 
include actinomycosis; nocardiosis; cryptococcosis, blastomycosis, histoplasmosis and 
coccidioidomycosis; candidiasis, aspergillosis, and mucormycosis; sporotrichosis; 
paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma and chromomycosis; and 
20 dermatophytosis. Rickettsial infections include Typhus fever, Rocky Mountain spotted 
fever, Q fever, and Rickettsialpox. Examples of mycoplasma and chlamydial infections 
include: mycoplasma pneumoniae; lymphogranuloma venereum; psittacosis; and perinatal 
chlamydial infections. Pathogenic eukaryotes encompass pathogenic protozoans and 
helminths and infections produced thereby include: amebiasis; malaria; leishmaniasis; 
25 trypanosomiasis; toxoplasmosis; Pneumocystis carinii; Trichans; Toxoplasma gondii; 
babesiosis; giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes or 
flukes; and cestode (tapeworm) infections. 

Many of these organisms and/or toxins produced thereby have been 
identified by the Centers for Disease Control [(CDC), Department of Heath and Human 
30 Services, USA], as agents which have potential for use in biological attacks. For example, 

some of these biological agents, include, Bacillus anthracis (anthrax), Clostridium botulinum 
and its toxin (botulism), Yersinia pestis (plague), variola major (smallpox), Francisella 
iularensis (tularemia), and viral hemorrhagic fever, all of which are currently classified as 
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Category A agents; Coxiella burnetii (Q fever); Brucella species (brucellosis), Burkholdeha 
mallei (glanders), Ricinus communis and its toxin (ricin toxin), Clostridium perfringens and 
its toxin (epsilon toxin), Staphylococcus species and their toxins (enterotoxin B), all of which 
are currently classified as Category B agents; and Nipan virus and hantaviruses, which are 
5 currently classified as Category C agents. In addition, other organisms, which are so 
classified or differently classified, may be identified and/or used for such a purpose in the 
future. It will be readily understood that the viral vectors and other constructs described 
herein are useful to deliver antigens from these organisms, viruses, their toxins or other by- 
products, which will prevent and/or treat infection or other adverse reactions with these 

10 biological agents. 

Administration of the vectors of the invention to deliver immunogens against 
the variable region of the T cells elicit an immune response including CTLs to eliminate 
those T cells. In rheumatoid arthritis (RA), several specific variable regions of T cell 
receptors (TCRs) which are involved in the disease have been characterized. These TCRs 

15 include V-3, V-14, V-17 and Va-17. Thus, delivery of a nucleic acid sequence that encodes 
at least one of these polypeptides will elicit an immune response that will target T cells 
involved in RA. In multiple sclerosis (MS), several specific variable regions of TCRs which 
are involved in the disease have been characterized. These TCRs include V-7 and Va-10. 
Thus, delivery of a nucleic acid sequence that encodes at least one of these polypeptides will 

20 elicit an immune response that will target T cells involved in MS. In scleroderma, several 
specific variable regions of TCRs which are involved in the disease have been characterized. 
These TCRs include V-6, V-8, V-14 and Va-16, Va-3C, Va-7, Va-14, Va-15, Va-16, 
Va-28 and Va-12. Thus, delivery of a nucleic acid molecule that encodes at least one of 
these polypeptides will elicit an immune response that will target T cells involved in 

25 scleroderma. 

Optionally, vectors containing AAV sequences of the invention may be 
delivered using a prime-boost regimen. A variety of such regimens have been described in 
the art and may be readily selected. See, e.g., WO 00/1 1 140, published March 2, 2000, 
incorporated by reference. 
30 Such prime-boost regimens typically involve the administration of a DNA 

(e.g., plasmid) based vector to prime the immune system to second, booster, administration 
with a traditional antigen, such as a protein or a recombinant virus carrying the sequences 
encoding such an antigen. In one embodiment, the invention provides a method of priming 
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and boosting an immune response to a selected antigen by delivering a plasmid DNA vector 
carrying said antigen, followed by boosting, e.g., with a vector containing AAV sequences of 
the invention. 

In one embodiment, the prime-boost regimen involves the expression of 
5 multiproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, Science, 
292:69-74 (6 April 2001) which describes a multiprotein regimen for expression of protein 
subunits useful for generating an immune response against HIV and SIV. For example, a 
DNA prime may deliver the Gag, Pol, Vif VPX and Vpr and Env, Tat, and Rev from a 
single transcript. Alternatively, the SIV Gag, Pol and HIV-1 Env is delivered. 

10 However, the prime-boost regimens are not limited to immunization for HIV or to 

delivery of these antigens. For example, priming may involve delivering with a first chimp 
vector of the invention followed by boosting with a second chimp vector, or with a 
composition containing the antigen itself in protein form. In one or example, the prime- 
boost regimen can provide a protective immune response to the virus, bacteria or other 

15 organism from which the antigen is derived. In another desired embodiment, the prime- 
boost regimen provides a therapeutic effect that can be measured using convention assays for 
detection of the presence of the condition for which therapy is being administered. 

The priming vaccine may be administered at various sites in the body in a 
dose dependent manner, which depends on the antigen to which the desired immune 

20 response is being targeted. The invention is not limited to the amount or situs of injection(s) 
or to the pharmaceutical carrier. Rather, the priming step encompasses treatment regimens 
which include a single dose or dosage which is administered hourly, daily, weekly or 
monthly, or yearly. As an example, the mammals may receive one or two priming injection 
containing between about 10 pg to about 50 pg of plasmid in carrier. A desirable priming 

25 amount or dosage of the priming DNA vaccine composition ranges between about 1 pg to 
about 10,000 pg of the DNA vaccine. Dosages may vary from about 1 pg to 1000 pg DNA 
per leg of subject body weight. The amount or site of injection is desirably selected based 
upon the identity and condition of the mammal being vaccinated. 

The dosage unit of the DNA vaccine suitable for delivery of the antigen to 

30 the mammal is described herein. The DNA vaccine is prepared for administration by being 
suspended or dissolved in a pharmaceutical !y or physiologically acceptable carrier such as 
isotonic saline, isotonic salts solution or other formulations which will be apparent to those 
skilled in such administration. The appropriate carrier will be evident to those skilled in the 
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art and will depend in large part upon the route of administration. The compositions of the 
invention may be administered to a mammal according to the routes described above, in a 
sustained release formulation using a biodegradable biocompatible polymer, or by on-site 
delivery using micelles, gels and liposomes. 
5 Optionally, the priming step of this invention also includes administering 

with the priming DNA vaccine composition, a suitable amount of an adjuvant, such as are 
defined herein. 

Preferably, a boosting composition is administered about 2 to about 27 weeks 
after administering the priming DNA vaccine to the mammalian subject. The administration 

10 of the boosting composition is accomplished using an effective amount of a boosting vaccine 
composition containing or capable of delivering the same antigen as administered by the 
priming DNA vaccine. The boosting composition may be composed of a recombinant viral 
vector derived from the same viral source or from another source. Alternatively, the 
"boosting composition" can be a composition containing the same antigen as encoded in the 

15 priming DNA vaccine, but in the form of a protein or peptide, which composition induces an 
immune response in the host. In another embodiment, the boosting vaccine composition 
includes a composition containing a DNA sequence encoding the antigen under the control 
of a regulatory sequence directing its expression in a mammalian cell, e.g., vectors such as 
well-known bacterial or viral vectors. The primary requirements of the boosting vaccine 

20 composition are that the antigen of the vaccine composition is the same antigen, or a cross- 
reactive antigen, as that encoded by the DNA vaccine. 

Suitably, the vectors of the invention are also well suited for use in regimens 
which use non~AAV vectors as well as proteins, peptides, and/or other biologically useful 
therapeutic or immunogenic compounds. These regimens are particularly well suited to gene 

25 delivery for therapeutic poses and for immunization, including inducing protective 
immunity. Such uses will be readily apparent to one of skill in the art. 

Further, a vector of the invention provides an efficient gene transfer vehicle 
which can deliver a selected transgene to a selected host cell in vivo or ex vivo even where 
the organism has neutralizing antibodies to one or more AAV serotypes. In one 

30 embodiment, the vector (e.g., an rAAV) and the cells are mixed ex vivo; the infected cells are 
cultured using conventional methodologies; and the transduced cells are re-infused into the 
patient. Further, the vectors of the invention may also be used for production of a desired 
gene product in vitro. For in vitro production, a desired product (e.g., a protein) may be 
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obtained from a desired culture following transfection of host cells with a rAAV containing 
the molecule encoding the desired product and culturing the cell culture under conditions 
which permit expression. The expressed product may then be purified and isolated, as 
desired. Suitable techniques for transfection, cell culturing, purification, and isolation are 
known to those of skill in the art. 

The following examples illustrate several aspects and embodiments of the invention. 

EXAMPLES 

Example 1 : PCR amplification, cloning and characterization of novel AAV sequences. 

Tissues from nonhuman primates were screened for AAV sequences using a 
PCR method based on oligonucleotides to highly conserved regions of known AAVs. A 
stretch of AAV sequence spanning 2886 to 3143 bp of AAV1 [SEQ ID NO:6] was selected 
as a PCR amplicon in which a hypervariable region of the capsid protein (Cap) that is unique 
to each known AAV serotype, which is termed herein a "signature region," is flanked by 
conserved sequences. In later analysis, this signature region was shown to be located 
between conserved residues spanning hypervariable region 3. 

An initial survey of peripheral blood of a number of nonhuman primate 
species revealed detectable AAV in a subset of animals from species such as rhesus 
macaques, cynomologous macaques, chimpanzees and baboons. However, there were no 
AAV sequences detected in some other species tested, including Japanese macaques, pig- 
tailed macaques and squirrel monkeys. A more extensive analysis of vector distribution was 
conducted in tissues of rhesus monkeys of the University of Pennsylvania and Tulane 
colonies recovered at necropsy. This revealed AAV sequence throughout a wide array of 
tissues. 

A. Amplification of an AAV signature region 

DNA sequences of AAV 1-6 and AAVs isolated from Goose and 
Duck were aligned to each other using "Clustal W" at default settings. The alignment for 
AAV 1-6, and including the information for the novel AAV7, is provided in Fig. 1. Sequence 
similarities among AAVs were compared. 
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In the line of study, a 257 bp region spanning 2886 bp to 3 143 bp of 
AAV 1 [SEQ ID NO: 6], and the corresponding region in the genomes of AAV 2-6 genomes 
[See, Fig. 1], was identified by the inventors. This region is located with the AAV capsid 
gene and has highly conserved sequences among at both 5' and 3' ends and is relatively 
5 variable sequence in the middle. In addition, this region contains a Dralll restriction enzyme 
site (CACCACGTC, SEQ ID NO: 15) . The inventors have found that this region serves as 
specific signature for each known type of AAV DNA. In other words, following PCR 
reactions, digestion with endonucleases that are specific to each known serotypes and gel 
electrophoresis analysis, this regions can be used to definitively identify amplified DNA as 
10 being from serotype 1, 2, 3, 4, 5, 6, or another serotype. 

The primers were designed, validated and PCR conditions optimized 
with AAV1, 2 and 5 DNA controls. The primers were based upon the sequences of AAV2: 
5' primer, IS: bp 2867-2891 of AAV2 (SEQ ID NO:7) and 3' primer, 18as, bp 3095-3121 of 
AAV2(SEQ IDNO:7). 

15 Cellular DNAs from different tissues including blood, brain, liver, 

lung, testis, etc. of different rhesus monkeys were studied utilizing the strategy described 
above. The results revealed that DNAs from different tissues of these monkeys gave rise to 
strong PCR amplifications. Further restriction analyses of PCR products indicated that they 
were amplified from AAV sequences different from any published AAV sequences. 

20 PCR products (about 255 bp in size) from DNAs of a variety of 

monkey tissues have been cloned and sequenced. Bioinformatics study of these novel AAV 
sequences indicated that they are novel AAV sequences of capsid gene and distinct from 
each other. Fig. 1 includes in the alignment the novel AAV signature regions for AAV 10- 12 
[SEQ ID NOT 17, 118 and 119, respectively]. Multiple sequence alignment analysis was 

25 performed using the Clustal W (1.81) program. The percentage of sequence identity 
between the signature regions of AAV 1-7 and AAV 10-12 genomes is provided below. 
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Table 1. Sequences for Analysis 



Sequence # 


AAV Serotype 


Size (bp) 








1 


AAV1 


258 


1 


AAV2 


255 




AAV3 


255 


4 


AAV4 


246 




AAVj 


258 


6 


AAV6 


258 


7 


AAV7 


258 


10 


AAV 10 


2S5 


11 


AAV 11 


258 


12 


AAV12 


255 



Table 3. Pairwise Alignment (Percentage of Identity) 





AAV2 


AAV3 


AAV4 


AAV5 


AAV6 


AAV7 


AAV10 


AAV11 


AAV12 


AAVl 


90 


90 


81 


76 


97 


91 


93 


94 


93 


AAV2 




93 


79 


78 


90 


90 


93 


93 


92 


AAV3 






80 


76 


90 


92 


92 


92 


92 


AAV4 








76 


81 


84 


82 


81 


79 


AAV5 










75 


78 


79 


79 


76 


AAV6 












91 


92 


94 


94 


AAV7 














94 


92 


92 


AAV10 
















95 


93 


AAV11 


















94 



10 



15 



20 



Over 300 clones containing novel AAV serotype sequences that span the 
seleeted 257 bp region were isolated and sequenced. Bioinformatics analysis of these 300+ 
clones suggests that this 257 bp region is critical in serving as a good land marker or 
signature sequence for quick isolation and identification of novel AAV serotype. 

B. Use of the signature region for PCR amplification. 

The 257 bp signature region was used as a PCR anchor to extend 
PCR amplifications to 5' of the genome to cover the junction region of rep and cap genes 
(1398 bp - 3143 bp, SEQ ID NO:6) and 3' of the genome to obtain the entire cap gene 
sequence (2866 bp - 4600 bp, SEQ ID NO:6). PCR amplifications were carried out using 
the standard conditions, including denaturing at 95°C for 0.5-1 min, annealing at 60-65°C for 
0.5-1 min and extension at 72° C for I min per kb with a total number of amplification 
cycles ranging from 28 to 42. 

Using the aligned sequences as described in "A", two other relative 
conserved regions were identified in the sequence located in 3' end of rep genes and 5' to the 
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257 bp region and in the sequence down stream of the 257 bp fragment but before the AAV 5 
3 1TR. Two sets of new primers were designed and PCR conditions optimized for recovery 
of entire capsid and a part of rep sequences of novel AAV serotypes. More specifically, for 
the 5' amplification, the 5' primer, AVINs, was GCTGCGTCAACTGGACCAATGAGAAC 
5 [nt 1398-1423 of AAV1, SEQ IDNO:6] and the 3' primer was 18as, identified above. For 
the 3' amplification, the 5' primer was Is, identified above, and the 3' primer was AV2Las, 
TCGTTTCAGTTGAACTTTGGTCTCTGCG [nt 4435-4462 of AAV2, SEQ ID NO:7]. 

In these PCR amplifications, the 257 bp region was used as a PCR 
anchor and land marker to generate overlapping fragments to construct a complete capsid 
10 gene by fusion at the Dralll site in the signature region following amplification of the 5' and 
3' extension fragments obtained as described herein. More particularly, to generate the intact 
AA V7 cap gene, the three amplification products (a) the sequences of the signature region; 
(b) the sequences of the 5' extension; and (c) the sequences of the 3' extension were cloned 
into a pCR4-Topo [Invitrogen] plasmid backbone according to manufacturer's instructions. 
15 Thereafter, the plasmids were digested with Dralll and recombined to form an intact cap 
gene. 

In this line of work, about 80 % of capsid sequences of AAV7 and 
AAV 8 were isolated and analyzed. Another novel serotype, AAV9, was also discovered 
from Monkey #2. 

?0 Using the PCR conditions described above, the remaining portion of 

the rep gene sequence for AAV7 is isolated and cloned using the primers that amplify 108 bp 
to 1461 bp of AAV genome (calculated based on the numbering of AAV2, SEQ ID NO:7). 
This clone is sequenced for construction of a complete AAV7 genome without ITRs. 
C. Direct Amplification of 3 . 1 kb Cap fragment 

25 To directly amplify a 3.1 kb full-length Cap fragment from NHP 

tissue and blood DNAs, two other highly conserved regions were identified in AAV 
genomes for use in PCR amplification of large fragments. A primer within a conserved 
region located in the middle of the rep gene was selected (AVlns: 5 1 
GCTGCGTCAACTGGACCAATGAGAAC 3', nt 1398-1423 of SEQ ID NO:6) in 

30 combination with the 3' primer located in another conserved region downstream of the Cap 
gene (AV2cas: 5' CGCAGAGACCAAAGTTCAACTGAAACGA 3', SEQ ID W):7) for 
amplification of full-length cap fragments. The PCR products were Topo-cloned according 
to manufacturer's directions (Invitrogen) and sequence analysis was performed by 
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Qiagengenomics (Qiagengenomics, Seattle, WA) with an accuracy of > 99.9%. A total of 50 
capsid clones were isolated and characterized. Among them, 37 clones were derived from 
Rhesus macaque tissues (rh.l - rh.37), 6 clones from cynomologous macaques (cy.l - cy.6), 
2 clones from Baboons (bb.l and bb.2) and 5 clones from Chimps (ch.l - ch.5). 
5 To rule out the possibility that sequence diversity within the novel AAV 

family was not an artifact of the PCR, such as PCR-mediated gene splicing by overlap 
extension between different partial DNA templates with homologous sequences, or the result 
of recombination process in bacteria, a series of experiments were performed under identical 
conditions for VP1 amplification using total cellular DNAs. First, intact AAV7 and AAY8 
10 plasmids were mixed at an equal molar ratio followed by serial dilutions. The serially diluted 
mixtures were used as templates for PCR amplification of 3.1 leb VP1 fragments using 
universal primers and identical PCR conditions to that were used for DNA amplifications to 
see whether any hybrid PCR products were generated. The mixture was transformed into 
bacteria and isolated transformants to look for hybrid clones possibly derived from 
15 recombination process in bacterial cells. In a different experiment, we restricted AAV7 and 
AAV8 plasmids with Msp I, Ava I and Hael, all of which cut both genomes multiple times at 
different positions, mixed the digestions in different combinations and used them for PCR 
amplification of VP1 fragments under the same conditions to test whether any PCR products 
could be generated through overlap sequence extension of partial AAV sequences. In another 
20 experiment, a mixture of gel purified 5' 1.5 lcb AAV7 VP1 fragment and 3' 1.7 lcb AAV8 
VP1 fragment with overlap in the signature region was serially diluted and used for PCR 
amplification in the presence and absence of 200 ng cellular DNA extracted from a monkey 
cell line that was free of AAV sequences by TaqMan analysis. None of these experiments 
demonstrated efficient PCR-mediated overlap sequence production under the conditions of 
25 the genomic DNA Cap amplification (data not shown). As a further confirmation, 3 pairs of 
primers were designed, which were located at different HVRs, and were sequence specific to 
the variants of clone 42s from Rhesus macaque F953, in different combinations to amplify 
shorter fragments from mesenteric lymph node (MLN) DNA from F953 from which clone 
42s were isolated. All sequence variations identified in full-length Cap clones were found in 
30 these short fragments (data not shown). 
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Example 2: Adeno-Associated Viruses Undergo Substantial Evolution in Primates 
During Natural Infections 

Sequence analysis of selected AAV isolates revealed divergence throughout the 
genome that is most concentrated in hypervariable regions of the capsid proteins. 

5 Epidemiologic data indicate that all known serotypes are endemic to primates, although 
isolation of clinical isolates has been restricted to AAV2 and AAV3 from anai and throat 
swabs of human infants and AAV5 from a human condylomatous wart. No known clinical 
sequalae have been associated with AAV infection. 

In an attempt to better understand the biology of AAV, nonhuman primates were 

10 used as models to characterize the sequlae of natural infections. Tissues from nonhuman 

primates were screened for AAV sequences using the PCR method of the invention based on 
oligonucleotides to highly conserved regions of known AAVs (see Example 1). A stretch of 
AAV sequence spanning 2886 to 3143 bp of AAV1 [SEQ IDNO:6] was selected as a PCR 
amplicon in which conserved sequences are flanked by a hypervariable region that is unique 

15 to each known AAV serotype, termed herein a "signature region.' 5 

An initial survey of peripheral blood of a number of nonhuman primate species 
including rhesus monkeys, cynomologous monkeys, chimpanzees, and baboons revealed 
detectable AAV in a subset of animals from all species. A more extensive analysis of vector 
distribution was conducted in tissues of rhesus monkeys of the University of Pennsylvania 

20 and Tulane colonies recovered at necropsy. This revealed AAV sequence throughout a wide 
array of tissues. 

The amplified signature sequences were subcloned into plasmids and individual 
transformants were subjected to sequence analysis. This revealed substantial variation in 
nucleotide sequence of clones derived from different animals. Variation in the signature 

25 sequence was also noted in clones obtained within individual animals. Tissues harvested 
from two animals in which unique signature sequences were identified (i.e., colon from 
98E044 and heart from 98E056) were further characterized by expanding the sequence 
amplified by PCR using oligonucleotides to highly conserved sequences. In this way, 
complete proviral structures were reconstructed for viral genomes from both tissues as 

30 described herein. These proviruses differ from the other known AAVs with the greatest 
sequence divergence noted in regions of the Cap gene. 

Additional experiments were performed to confirm that AAV sequences resident to 
the nonhuman primate tissue represented proviral genomes of infectious virus that is capable 
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of being rescued and form virions. Genomic DNA from liver tissue of animal 98E056, from 
which AAV8 signature sequence was detected, was digested with an endonuclease that does 
not have a site within the AAV sequence and transfected into 293 cells with a plasmid 
containing an El deleted genome of human adenovirus serotype 5 as a source of helper 

5 functions. The resulting lysate was passaged on 293 cells once and the lysate was recovered 
and analyzed for the presence of AAV Cap proteins using a broadly reacting polyclonal 
antibody to Cap proteins and for the presence and abundance of DNA sequences from the 
PCR amplified AAV provirus from which AAV8 was derived. Transfection of endonuclease 
restricted heart DNA and the adenovirus helper plasmid yielded high quantities of AAV8 

10 virus as demonstrated by the detection of Cap proteins by Western blot analysis and the 
presence of 10 4 AAV8 vector genomes per 293 cell. Lysates were generated from a large- 
scale preparation and the AAV was purified by cesium sedimentation. The purified 
preparation demonstrated 26 nm icosohedral structures that look identical to those of AAV 
serotype 2. Transfection with the adenovirus helper alone did not yield AAV proteins or 

15 genomes, ruling out contamination as a source of the rescued AAV. 

To further characterize the inter and intra animal variation of AAV signature 
sequence, selected tissues were subjected to extended PCR to amplify entire Cap open 
reading frames. 

The resulting fragments were cloned into bacterial plasmids and individual 
20 transformants were isolated and fully sequenced. This analysis involved mesenteric lymph 
nodes from three rhesus monkeys (Tulane/V223 - 6 clones; Tulane/T612 - 7 clones; 
Tulane/F953 - 14 clones), liver from two rhesus monkeys (Tulane/V251 - 3 clones; 
Penn/00E033 - 3 clones), spleen from one rhesus monkey (Penn/97E043 ~ 3 clones), heart 
from one rhesus monkey (IHGT/98E046- 1 clone) and peripheral blood from one 
25 chimpanzee (New Iberia/X133 - 5 clones), six cynomologous macaques (Charles 

River/A1378, A3099, A3388, A3442, A2821, A3242-6 clones total) and one Baboon 
(SFRB/8644 - 2 clones). Of the 50 clones that were sequenced from 15 different animals, 30 
were considered non-redundant based on the finding of at least 7 amino acid differences 
from one another. The non-redundant VP I clones are numbered sequentially as they were 
30 isolated, with a prefix indicating the species of non-human primate from which they were 
derived. The structural relationships between these 30 non-redundant clones and the 
previously described 8 AAV serotypes were determined using the SplitsTree program 
[Huson, D. H. SplitsTree: analyzing and visualizing evolutionary data. Bioinfonnatics 14, 
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68-73 (1998)] with implementation of the method of split decomposition. The analysis 
depicts homoplasy between a set of sequences in a tree-like network rather than a bifurcating 
tree. The advantage is to enable detection of groupings that are the result of convergence and 
to exhibit phylogenetic relationships even when they are distorted by parallel events. 
5 Extensive phylogenetic research will be required in order to elucidate the AAV evolution, 
whereas the intention here only is to group the different clones as to their sequence 
similarity. 

To confirm that the novel VP1 sequences were derived from infectious viral 
genomes, cellular DNA from tissues with high abundance of viral DNA was restricted with 

10 an endonuclease that should not cleave within AAV and transfected into 293 ceils, followed 
by infection with adenovirus. This resulted in rescue and amplification of AAV genomes 
from DNA of tissues from two different animals (data not shown). 

VP1 sequences of the novel AAVs were further characterized with respect to the 
nature and location of amino acid sequence variation. All 30 VP1 clones that were shown to 

15 differ from one another by greater than 1% amino acid sequence were aligned and scored for 
variation at each residue. An algorithm developed to determine areas of sequence 
divergence yielded 12 hypervariable regions (HVR) of which 5 overlap or are part of the 4 
previously described variable regions [Kotin, cited above; Rutledge, cited above]. The three- 
fold-proximal peaks contain most of the variability (HVR5-10). Interestingly the loops 

20 located at the 2 and 5 fold axis show intense variation as well. The HVRs I and 2 occur in 
the N-terminal portion of the capsid protein that is not resolved in the X-ray structure 
suggesting that the N-terminus of the VP1 protein is exposed on the surface of the virion. 

Real-time PCR was used to quantify AAV sequences from tissues of 21 rhesus 
monkeys using primers and probes to highly conserved regions of Rep (one set) and Cap 

25 (two sets) of known AAVs. Each data point represents analysis from tissue DNA from an 
individual animal. This confirmed the wide distribution of AAV sequences, although the 
quantitative distribution differed between individual animals. The source of animals and 
previous history or treatments did not appear to influence distribution of AAV sequences in 
rhesus macaques. The three different sets of primers and probes used to quantify AAV 

30 yielded consistent results. The highest levels of AAV were found consistently in mesenteric 
lymph nodes at an average of 0.01 copies per diploid genome for 13 animals that were 
positive. Liver and spleen also contained high abundance of virus DNA. There were 
examples of very high AAV, such as in heart of rhesus macaque 98E056, spleen of rhesus 
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macaque 97E043 and liver of rhesus macaque RQ4407, which demonstrated 1.5, 3 and 20 
copies of AAV sequence per diploid genome respectively. Relatively low levels of virus 
DNA were noted in peripheral blood mononuclear cells, suggesting the data in tissue are not 
due to resident blood components (data not shown). It should be noted that this method 
5 would not necessarily capture all AAVs resident to the nonhuman primates since detection 
requires high homology to both the oligonucleotides and the real time PCR probe. Tissues 
from animals with high abundance AAV DNA was further analyzed for the molecular state 
of the DNA, by DNA hybridization techniques, and its cellular distribution, by in situ 
hybridization. 

10 The kind of sequence variation revealed in AAV proviral fragments isolated from 

different animals and within tissues of the same animals is reminiscent of the evolution that 
occurs for many RNA viruses during pandemics or even within the infection of an 
individual. In some situations the notion of a wild-type virus has been replaced by the 
existence of swarms of quasispecies that evolve as a result of rapid replication and mutations 
1 5 in the presence of selective pressure. One example is infection by HIV, which evolves in 

response to immunologic and pharmacologic pressure. Several mechanisms contribute to the 
high rate of mutations in RNA viruses, including low fidelity and lack of proof reading 
capacity of reverse transcriptase and non-homologous and homologous recombination. 

Evidence for the formation of quasispecies of AAV was illustrated in this study by 
20 the systematic sequencing of multiple cloned proviral fragments. In fact, identical sequences 
could not be found within any extended clones isolated between or within animals. An 
important mechanism for this evolution of sequence appears to be a high rate of homologous 
recombination between a more limited number of parenteral viruses. The net result is 
extensive swapping of hypervariable regions of the Cap protein leading to an array of 
25 chimeras that could have different tropisms and serologic specificities (i.e., the ability to 
escape immunologic responses especially as it relates to neutralizing antibodies). 
Mechanisms by which homologous recombination could occur are unclear. One possibility 
is that + and - strands of different single stranded AAV genomes anneal during replication as 
has been described during high multiplicity of infections with AAV recombinants. It is 
30 unclear if other mechanisms contribute to sequence evolution in AAV infections. The 

overall rate of mutation that occurs during AAV replication appears to be relatively low and 
the data do not suggest high frequencies of replication errors. However, substantial 
rearrangements of the AAV genome have been described during lytic infection leading to the 
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formation of defective interfering particles. Irrespective of the mechanisms that lead to 
sequence divergence, with few exceptions , vpl structures of the quasispecies remained 
intact without frameshifts or nonsense mutations suggesting that competitive selection of 
viruses with the most favorable profile of fitness contribute to the population dynamics. 

5 These studies have implications in several areas of biology and medicine. The 

concept of rapid virus evolution, formerly thought to be a property restricted to RNA viruses, 
should be considered in DNA viruses, which classically have been characterized by serologic 
assays. It will be important in terms of parvoviruses to develop a new method for describing 
virus isolates that captures the complexity of its structure and biology, such as with HIV, 

10 which are categorized as general families of similar structure and function called Clades. An 
alternative strategy is to continue to categorize isolates with respect to serologic specificity 
and develop criteria for describing variants within serologic groups. 



Example 3: Vectorology of recombinant AAV genomes equipped with AAV2 ITRs 
15 using chimeric plasmids containing AAV2 rep and novel AAV cap genes for serological 

and gene transfer studies in different animal models. 

Chimeric packaging constructs are generated by fusing AAV2 rep with cap 

sequences of novel AAV serotypes. These chimeric packaging constructs are used, initially, 

for pseudotyping recombinant AAV genomes carrying AAV2 ITRs by triple transfection in 
20 293 cell using Ad5 helper plasmid. These pseudotyped vectors are used to evaluate 

performance in transduction-based serological studies and evaluate gene transfer efficiency 

of novel AAV serotypes in different animal models including NHP and rodents, before intact 

and infectious viruses of these novel serotypes are isolated. 

A. pAAV2GFP 

25 The AAV2 plasmid which contains the AAV2 ITRs and green fluorescent 

protein expressed under the control of a constitutive promoter. This plasmid contains the 
following elements: the AAV2 ITRs, a CMV promoter, and the GFP coding sequences. 

B. Cloning of trans plasmid 

To construct the chimeric trans-plasmid for production of recombinant 
30 pseudotyped AAV7 vectors, p5E18 plasmid (Xiao et al, 1999, J. Virol 73:3994-4003) was 
partially digested with Xho I to linearize the plasmid at the Xho I site at the position of 3169 
bp only. The Xho 1 cut ends were then filled in and ligated back. This modified p5E18 
plasmid was restricted with Xba I and Xho I in a complete digestion to remove the AAV2 
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cap gene sequence and replaced with a 2267 bp Spe I/Xho I fragment containing the AAV7 
cap gene which was isolated from pCRAAV7 6-5+15-4 plasmid. 

The resulting plasmid contains the AA V2 rep sequences for Rep78/68 under the 
control of the AA V2 P5 promoter, and the AAV2 rep sequences for Rep52/40 under the control 
of the AAV2 PI 9 promoter. The AAV7 capsid sequences are under the control of the AAV2 
P40' promoter, which is located within the Rep sequences. This plasmid further contains a 
spacer 5' of the rep ORR 

C. Production of Pseudotyped rAA V 

The rAAV particles (AAV2 vector in AAV7 capsid) are generated using 
an adenovirus-free method. Briefly, the cis plasmid (pAAV2.1 lacZ plasmid containing 
AAV2 ITRs), and the trans plasmid pCRAAV7 6-5+15-4 (containing the AAV2 rep and 
AAV7 cap) and a helper plasmid, respectively, were simultaneously co-transfected into 293 
cells in a ratio of 1:1:2 by calcium phosphate precipitation. 

For the construction of the pAd helper plasmids, pBGl 0 plasmid was 
purchased from Microbix (Canada). A RsrII fragment containing L2 and L3 was deleted 
frompBHGlO, resulting in the first helper plasmid, pAdAF13. Plasmid AdA Fl was 
constructed by cloning Asp700/Sall fragment with a Pmel/Sgfl deletion, isolating from 
pBHGlO, into Bluescript. MLP, L2, L2 and L3 were deleted in the pAdAFl. Further 
deletions of a 2.3 kb Nrul fragment and, subsequently, a 0.5 kb RsrII/NruI fragment 
generated helper plasmids pAdAFS and pAdAF6, respectively. The helper plasmid, termed 
pAF6, provides the essential helper functions of E2a and E4 ORF6 not provided by the El- 
expressing helper cell, but is deleted of adenoviral capsid proteins and functional El 
regions). 

Typically, 50 fig of DNA (cis:trans:helper) was transfected onto a 150 mm 
tissue culture dish. The 293 cells were harvested 72 hours post-transfection, sonicated and 
treated with 0.5% sodium deoxycholate (37°C for 10 min.) Cell lysates were then subjected to 
two rounds of a CsCl gradient. Peak fractions containing rAAV vector are collected, pooled and 
dialyzed against PBS. 
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Example 4: Creation of infectious clones carrying intact novel AAV serotypes for study 
of basic virology in human and NHP derived cell lines and evaluation of pathogenesis of 
novel AAV serotypes in NHP and other animal models. 

To achieve this goal, the genome walker system is employed to obtain 5' and 

5 3' terminal sequences (ITRs) and complete construction of clones containing intact novel 
AAV serotype genomes. 

Briefly, utilizing a commercially available Universal Genome Walker Kit 
[Clontech], genomic DNAs from monkey tissues or cell lines that are identified as positive 
for the presence of AAV7 sequence are digested with Dra I, EcoR V, Pvu II and Stu I 

10 endonucleases and ligated to Genome Walker Adaptor to generate 4 individual Genome 
Walker Libraries (GWLs). Using DNAs from GWLs as templates, AAV7 and adjacent 
genomic sequences will be PCR-amplified by the adaptor primer 1 (API, provided in the kit) 
and an AAV7 specific primer 1, followed by a nested PGR using the adaptor primer 2 (AP2) 
and another AAV7 specific primer 2, both of which are internal to the first set of primers. 

15 The major PCR products from the nested PCR are cloned and characterized by sequencing 
analysis. 

. In this experiment, the primers covering the 257 bp or other signature 
fragment of a generic AAV genome are used for PCR amplification of cellular DNAs 
extracted from Human and NHP derived cell lines to identify and characterize latent AAV 

20 sequences. The identified latent AAV genomes are rescued from the positive cell lines using 
adenovirus helpers of different species and strains. 

To isolate infectious AAV clones from NHP derived cell lines, a desired cell 
line is obtained from ATCC and screened by PCR to identify the 257 bp amplicom i.e., 
signature region of the invention. The 257 bp PCR product is cloned and serotyped by 

25 sequencing analysis. For these cell lines containing the AAV7 sequence, the cells are 

infected with SV-15, a simian adenovirus purchased from ATCC, human Ad5 or transfected 
with plasmid construct housing the human Ad genes that are responsible for AAV helper 
functions. At 48 hour post infection or transfection, the cells are harvested and Hirt DNA is 
prepared for cloning of AAV7 genome following Xiao et ah, 1999, J. Virol, 73:3994-4003. 

30 

Example 5 - Production of AAV Vectors 

A pseudotyping strategy similar to that of Example 3 for AAV1/7 was employed to 
produce AAV2 vectors packaged with AAV1, AAV5 and AAV8 capsid proteins. Briefly, 
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recombinant AAV genomes equipped with AAV2 ITRs were packaged by triple transfection 
of 293 cells with cis-plasmid, adenovirus helper plasmid and a chimeric packaging construct 
where the AAV2 rep gene is fused with cap genes of novel AAV serotypes. To create the 
chimeric packaging constructs, the Xho I site of p5El 8 plasmid at 3169 bp was ablated and 

5 the modified plasmid was restricted with Xba I and Xho I in a complete digestion to remove 
the AAV2 cap gene and replace it with a 2267 bp Spe I/Xho I fragment containing the 
AAV8 cap gene [Xiao, W., et al., (1999) J Virol 73, 3994-4003]. A similar cloning strategy 
was used for creation of chimeric packaging plasmids of AAV2/1 and AAV2/5. All 
recombinant vectors were purified by the standard CsCl2 sedimentation method except for 

10 AAV2/2, which was purified by single step heparin chromatography. 

Genome copy (GC) titers of AAV vectors were determined by TaqMan analysis 
using probes and primers targeting SV40 poly A region as described previously [Gao, G., et 
ah, (2000) Hum Gene Ther 11, 2079-91]. 

• Vectors were constructed for each serotype for a number of in vitro and in vivo 

1 5 studies. Eight different transgene cassettes were incorporated into the vectors and 

recombinant virions were produced for each serotype. The recovery of virus, based on 
genome copies, is summarized in Table 4 below. The yields of vector were high for each 
serotype with no consistent differences between serotypes. Data presented in the table are 
average genome copy yields with standard deviation x 10 13 of multiple production lots of 50 

20 plate (1 50 mm ) transfections. 



Table 4. Production of Recombinant Vectors 





AAV2/1 


AAV2/2 


AAV2/5 


AAV2/7 


AAV2/8 


CMV 


7.30 + 4.33 


4.49 + 2.89 


5.19 + 5.19 


3.42 


0.87 


LacZ 


(n=9) 


(n«6) 


(n=8) 


(n=l) 


(n=l) 


CMV 


6.43 + 2.42 


3.39 + 2.42 


5.55 + 6.49 


2.98 + 2.66 


3.74 + 3.88 


EGFP 


(n=2) 


(n=2) 


(n=4) 


(n=2) 


(n=2) 


TBG LacZ 


4.18 


0.23 


0.704 + 0.43 


2.16 


0.532 




(n*l) 


(n=l) 


(n=2) 


(n=l) 


(n=l) 


Alb Al AT 


4.67 + 0.75 


4.77 


4.09 


5.04 


2.02 




(n=2) 


(n=l) 


(n=D 


(n=l) 


(n=l) 


CB A1AT 


0.567 


0.438 


2.82 


2.78 


0.816 + 




(n=l) 


(n=l) 


(n=l) 


(n=l) 


0.679 (n=2) 
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AAV2/1 


AAV2/2 


AAV2/5 


AAV2/7 


AAV2/S 


TBG 
r\\CG 


8.51 +6.65 
(n=6) 


3.47 + 2.09 
(n=5) 


5.26 + 3.S5 
(n=4) 


6.52 + 3.08 
(n=4) 


1.83+0.98 
(n=5) 


TBG cFIX 


1.24+ 1.29 
(n=3) 


0.63 + 0.394 
(n=6) 


3.74 + 2.48 
(n=7) 


4.05 
(n=l) 


15.8+ 15.0 
(n=5) 



Example 6 - Serologic Analysis of Pseudotyped Vectors 

C57BL/6 mice were injected with vectors of different serotypes of AAVCBA1 AT 
vectors intramuscularly (5 x 10 n GC) and serum samples were collected 34 days later. To 
test neutralizing and cross-neutralizing activity of sera to each serotype of AAV, sera was 
analyzed in a transduction based neutralizing antibody assay [Gao, G. P., et al., Q 996) J 
Virol 70, 8934-43]. More specifically, the presence of neutralizing antibodies was 
determined by assessing the ability of serum to inhibit transduction of 84-31 cells by reporter 
viruses (AAVCMVEGFP) of different serotypes. Specifically, the reporter virus 
AAVCMVEGFP of each serotype [at multiplicity of infection (MOI) that led to a 
transduction of 90% of indicator cells] was pre- incubated with heat-inactivated serum from 
animals that received different serotypes of AAV or from naive mice. After 1-hour 
incubation at 37° C, viruses were added to 84-31 ceils in 96 well plates for 4S or 72- hour, 
depending on the virus serotype. Expression of GFP was measured by Fluorolmagin 
(Molecular Dynamics) and quantified by Image Quant Software. Neutralizing antibody titers 
were reported as the highest serum dilution that inhibited transduction to less than 50%. 

The availability of GFP expressing vectors simplified the development of an assay 
for neutralizing antibodies that was based on inhibition of transduction in a permissive ceil 
line (i.e., 293 cells stably expressing E4 from Ad5). Sera to selected AAV serotypes were 
generated by intramuscular injection of the recombinant viruses. Neutralization of AA V 
transduction by 1 :20 and 1 :80 dilutions of the antisera was evaluated (See Table 5 below). 
Antisera to AAV1, AAV2, AAV5 and AAV8 neutralized transduction of the serotype to 
which the antiserum was generated (AAV5 and AAV8 to a lesser extent than AAV1 and 
AAV2) but not to the other serotype (i.e., there was no evidence of cross neutralization 
suggesting that AAV 8 is a truly unique serotype). 
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Table 5. Serological Analysis of New AAV Serotypes. 



5 







% 


nfection on 84-31 cells with AAVCMVEGFP virus: 






AAV2/1 


AAV2/2 


AAV2/5 


AAV2/7 


AAV2/8 






Serum dilution: 


Serum dilution: 


Serum dilution: 


Serum dilution: 


Serum dilution: 


Sera: 


Immunization Vector 


7/20 


1/80 


1/20 


1/80 


1/20 


1/80 


1/20 


1/80 


1/20 


1/80 


Group 1 


AAV2/1 


0 


0 


100 


100 


100 


100 


100 


100 


100 


100 


Group 2 


AAV2/2 


100 


100 


0 


0 


100 


100 


100 


100 


100 


100 


Group 3 


AAV2/5 


100 


100 


100 


100 


16.5 


16.5 


100 


100 


100 


100 


Group 4 


AAV2/7 


100 


100 


100 


100 


100 


100 


61.5 


100 


100 


100 


Group 5 


AAV2/8 


100 


100 


100 


100 


100 


100 


100 


100 


26.3 


60 



10 Human sera from 52 normal subjects were screened for neutralization against 

selected serotypes. No serum sample was found to neutralize AAV2/7 and AAV2/8 while 
AAV2/2 and AAV2/1 vectors were neutralized in 20% and 10% of sera, respectively. A 
fraction of human pooled IgG representing a collection of 60,000 individual samples did not 
neutralize AAV2/7 and AAV2/8, whereas AAV2/2 and AAV2/1 vectors were neutralized at 

] 5 titers of serum equal to 1/1280 and 1/640, respectively. 

Example 7 - In vivo Evaluation of Different Serotypes of AAV Vectors 

In this study, 7 recombinant AAV genomes, AAV2CBhAl AT, AAV2AlbhAl AT, 
AAV2CMVrhCG, AAV2TBGrhCG, AAV2TBGcFIX, AAV2CMVLacZ and 

20 AAV2TBGLacZ were packaged with capsid proteins of different serotypes. In all 7 
constructs, minigene cassettes were flanked with AAV2 ITRs. cDNAs of human a- 
antitrypsin (Al AT) [Xiao, W., et al., (1999) J Virol 73, 3994-4003] p-subunit of rhesus 
monkey choriogonadotropic hormone (CG) [Zoltick, P. W. & Wilson, J. M. (2000; Mol Ther 
2, 657-9] canine factor IX [Wang, L., et al., (1997) Proc Natl Acad Sci US A 94,1] 563-6] 

25 and bacterial p-glactosidase (i.e., Lac Z) genes were used as reporter genes. For liver- 
directed gene transfer, either mouse albumin gene promoter (Alb) [Xiao, W. (1999), cited 
above] or human thyroid hormone binding globulin gene promoter (TBG) [Wang (1997), 
cited above] was used to drive liver specific expression of reporter genes. In muscle-directed 
gene transfer experiments, either cytomegalovirus early promoter (CMV) or chicken p-actin 

30 promoter with CMV enhancer (CB) was employed to direct expression of reporters. 

For muscle-directed gene transfer, vectors were injected into the right tibialis anterior 
of 4-6 week old NCR nude or C57BL/6 mice (Taconic, Germantown, NY). In liver-directed 
gene transfer studies, vectors were infused intraportally into 7-9 week old NCR nude or 
C57BL/6 mice (Taconic, Germantown, NY). Serum samples were collected intraorbital ly at 
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different time points after vector administration. Muscle and liver tissues were harvested at 
different time points for cryosectioning and Xgal histochemical staining from animals that 
received the lacZ vectors. For the re-administration experiment, C56BL/6 mice initially 
received AAV2/1, 2/2, 2/5, 2/7 and 2/8CBA1 AT vectors intramuscularly and followed for 

5 Al AT gene expression for 7 weeks. Animals were then treated with AAV2/8TBGcFIX 
intraportally and studied for cFIX gene expression. 

ELISA based assays were performed to quantify serum levels of hAlAT, rhCG and 
cFIX proteins as described previously [Gao, G. P., et al., (1996) J Virol 70, 8934-43; Zolticlc, 
P. W. & Wilson, J. M. (2000) Mol Ther 2, 657-9; Wang, L., et al, Proc Natl Acad Sal USA 

10 94, 1 1563-6]. The experiments were completed when animals were sacrificed for harvest of 
muscle and liver tissues for DNA extraction and quantitative analysis of genome copies of 
vectors present in target tissues by TaqMan using the same set of primers and probe as in 
titration of vector preparations [Zhang, Y, et a!, (2001) Mol Ther 3, 697-707]. 

The performance of vectors base on the new serotypes were evaluated in murine 

15 models of muscle and liver-directed gene transfer and compared to vectors based on the 
known serotypes AAV1, AAV2 and AAV5, Vectors expressing secreted proteins (alpha- 
antitrypsin (Al AT) and chorionic gonadotropin (CG)) were used to quantitate relative 
transduction efficiencies between different serotypes through ELISA analysis of sera. The 
cellular distribution of transduction within the target organ was evaluated using lacZ 

20 expressing vectors and X-gal histochemistry . 

The performance of AAV vectors in skeletal muscle was analyzed following direct 
injection into the tibialis anterior muscles. Vectors contained the same AAV2 based genome 
with the immediate early gene of CMV or a CMV enhanced (3-actin promoter driving 
expression of the transgene. Previous studies indicated that immune competent C57BL/6 

25 mice elicit limited humoral responses to the human Al AT protein when expressed from 
AAV vectors [Xiao, W., et al, (1999)/ Virol 73, 3994-4003]. 

In each strain, AAV2/1 vector produced the highest levels of Al AT and AAV2/2 
vector the lowest, with AAV2/7 and AAV2/8 vectors showing intermediate levels of 
expression. Peak levels of CG at 28 days following injection of nu/nu NCR mice showed the 

30 highest levels from AAV2/7 and the lowest from AAV2/2 with AAV2/8 and AAV2/1 in 
between. Injection of AAV2/1 and AAV2/7 lacZ vectors yielded gene expression at the 
injection sites in all muscle fibers with substantially fewer lacZ positive fibers observed with 
A A V2/2 and AAV 2/8 vectors. These data indicate that the efficiency of transduction with 
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AAV2/7 vectors in skeletal muscle is similar to that obtained with AAV2/1, which is the 
most efficient in skeletal muscle of the previously described serotypes [Xiao, W. (1999), 
cited above; Chao, H. 5 et ah, (2001) Mol Ther 4, 217-22; Chao, H, et al. 3 (2000) Mol Ther 2, 
619-23]. 

5 Similar murine models were used to evaluate liver-directed gene transfer. Identical 

doses of vector based on genome copies were infused into the portal veins of mice that were 
• analyzed subsequently for expression of the transgene. Each vector contained an AA V2 
based genome using previously described liver-specific promoters (i.e., albumin or thyroid 
hormone binding globulin) to drive expression of the transgene. More particularly, CMVCG 

10 and TBGCG minigene cassettes were used for muscle and liver-directed gene transfer, 

respectively. Levels of rhCG were defined as relative units (RUs x 10 3 ). The data were from 
assaying serum samples collected at day 28, post vector administration (4 animals per 
group). As shown in Table 3, the impact of capsid proteins on the efficiency of transduction 
of Al AT vectors in nu/nu and C57BL/6 mice and CG vectors in C57BL/6 mice was 

1 5 consistent (See Table 6). 

Tabic 6. Expression of P-unit of Rhesus Monkey Chorionic Gonadotropin (rhCG) 



Vector 


Muscle 


Liver 


AAV2/1 


4.5 ± 2.1 


1.6 + 1.0 


AAV2 


0.5 ± 0.1 


0.7 ± 0.3 


AAV2/5 


ND* 


4.8 ± 0.8 


AAV2/7 


14.2 ± 2.4 


8.2 ± 4.3 


AAV2/8 


4.0 ± 0.7 


76.0 ± 22.8 



* Not determined in this experiment. 

25 

In all cases, AAV2/8 vectors yielded the highest levels of transgene expression that 
ranged from 16 to 1 10 greater than what was obtained with AAV2/2 vectors; expression 
from AAV2/5 and AAV2/7 vectors was intermediate with AAV2/7 higher than AAV2/5. 
Analysis of X-Gal stained liver sections of animals that received the corresponding lacZ 
30 vectors showed a correlation between the number of transduced cells and overall levels of 
transgene expression. DNAs extracted from livers of C57BL/6 mice who received the A 1 AT 
vectors were analyzed for abundance of vector DNA using real time PCR technology. 
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The amount of vector DNA found in liver 56 days after injection correlated with 
the levels of transgene expression (See Table 7). For this experiment, a set of probe and 
primers targeting the SV40 polyA region of the vector genome was used for TaqMan 
PCR. Values shown are means of three individual animals with standard deviations. The 
5 animals were sacrificed at day 56 to harvest liver tissues for DNA extraction. These 
studies indicate that AAV8 is the most efficient vector for liver-directed gene transfer 
due to increased numbers of transduced hepatocytes. 

Table 7 - Real Time PCR Analysis for Abundance of AAV Vectors in nu/nu Mouse 
10 Liver Following Injection of lxlO 11 Genome Copies of Vector. 

AAV vectors/Dose Genome Copies per Cell 

AAV2/lAlbAlAT 0.6 ± 036 

15 

AAV2AlbAl AT 0.003 ± 0.001 

AAV2/5AlbAlAT 0.83 ± 0.64 

20 AAV2/7AlbAlAT 2.2 ± 1.7 

AAV2/8AlbAlAT 18 ±11 



25 The serologic data described above suggest that AAV2/8 vector should not be 

neutralized in vivo following immunization with the other serotypes. C57BL/6 mice 
received intraportal injections of AAV2/8 vector expressing canine factor IX (10 n genome 
copies) 56 days after they received intramuscular injections of A1AT vectors of different 
serotypes. High levels of factor IX expression were obtained 14 days following infusion of 

30 AAV2/8 into naive animals (17+2 jag/ml, n=4) which were not significantly different that 
what was observed in animals immunized with AAV2/1 (31+23 fig/ml, n=4) 3 AAV2/2 (16 
jig/ml, n=2), and AAV2/7 (12 [ig/ml 5 n=2). This contrasts to what was observed in AAV2/8 
immunized animals that were infused with the AAV2/8 factor IX vector in which no 
detectable factor IX was observed (< 0.1 ng/ml, n=4). 

35 Oligonucleotides to conserved regions of the cap gene did amplify sequences from 

rhesus monkeys that represented unique AAVs. Identical cap signature sequences were 
found in multiple tissues from rhesus monkeys derived from at least two.different colonies. 
Full-length rep and cap open reading frames were isolated and sequenced from single 
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sources. Only the cap open reading frames of the novel AAVs were necessary to evaluate 
their potential as vectors because vectors with the AAV7 or AAVS capsids were generated 
using the ITRs and rep from AAV2. This also simplified the comparison of different vectors 
since the actual vector genome is identical between different vector serotypes. In fact, the 
5 yields of recombinant vectors generated using this approach did not differ between 
serotypes. 

Vectors based on AAV7 and AAVS appear to be immunologically distinct (i.e., they 
are not neutralized by antibodies generated against other serotypes). Furthermore, sera from 
humans do not neutralize transduction by AAV7 and AAVS vectors, which is a substantial 

1 0 advantage over the human derived AAVs currently under development for which a 

significant proportion of the human population has pre-existing immunity that is neutralizing 
[Chirmule, N., etal., (1999) Gene Ther 6, 1574-83]. 

The tropism of each new vector is favorable for in vivo applications. 
AAV2/7 vectors appear to transduce skeletal muscle as efficiently as AAV2/1, which is the 

1 5 serotype that confers the highest level of transduction in skeletal muscle of the primate 
AAVs tested to date [Xiao, W, cited above; Chou (2001), cited above, and Chou (2000), 
cited above]. Importantly, AAV2/8 provides a substantial advantage over the other serotypes 
in terms of efficiency of gene transfer to liver that until now has been relatively 
disappointing in terms of the numbers of hepatocytes stably transduced. AAV2/8 

20 consistently achieved a 10 to 100-fold improvement in gene transfer efficiency as compared 
to the other vectors. The basis for the improved efficiency of AAV2/8 is unclear, although it 
presumably is due to uptake via a different receptor that is more active on the basolateral 
surface of hepatocytes. This improved efficiency will be quite useful in the development of 
liver-directed gene transfer where the number of transduced cells is critical, such as in urea 

25 cycle disorders and familial hypercholesterolemia. 

Thus, the present invention provides a novel approach for isolating new AAVs based 
on PCR retrieval of genomic sequences. The amplified sequences were easily incorporated 
into vectors and tested in animals. The lack of pre-existing immunity to AAV7 and the 
favorable tropism of the vectors for muscle indicates that AAV7 is suitable for use as a 

30 vector in human gene therapy and other in vivo applications. Similarly, the lack of pre- 
existing immunity to the AAV serotypes of the invention, and their tropisms, renders them 
useful in delivery of therapeutic molecules and other useful molecules. 
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Example 9 - Tissue Tropism Studies 

In the design of a high throughput functional screening scheme for novel AAV 
constructs, a non-tissue specific and highly active promoter, CB promoter (CMV enhanced 
chicken p actin promoter) was selected to drive an easily detectable and quantifiable reporter 

5 gene, human a anti-trypsin gene. Thus only one vector for each new AAV clone needs to be 
made for gene transfer studies targeting 3 different tissues, liver, lung and muscle to screen 
for tissue tropism of a particular AAV construct. The following table summarizes data 
generated from 4 novel AAV vectors in the tissue tropism studies (AAVCBA1 AT), from 
which a novel AAV capsid clone, 44.2, was found to be a very potent gene transfer vehicle 

10 in all 3 tissues with a big lead in the lung tissue particularly. Table 8 reports data obtained 
(in ^ig AlAT/mL serum) at day 14 of the study. 



Table 8 



Vector 


Target Tissue 




Lung 


Liver 


Muscle 


AAV2/1 


ND 


ND 


45±11 


AAV2/5 


0.6±0.2 


ND 


ND 


AAV2/8 


ND 


84±30 


ND 


AAV2/ih.2 (43.1) 


14±7 


25±7.4 


35±14 


AAV2/rh.l0 (44.2) 


23±6 


53±19 


46±11 


AAV2/rh.l3 (42.2) 


3.5±2 


2±0.8 


3.5±1.7 


AAV2/rh.21 (42.10) 


3.1±2 


2±1.4 


4.3±2 



15 A couple of other experiments were then performed to confirm the superior tropism of AAV 
44.2 in lung tissue. First, AAV vector carried CClOhAl AT minigene for lung specific 
expression were pseudotyped with capsids of novel AAVs were given to Immune deficient 
animals (NCR nude) in equal volume (50 \il each of the original preps without dilution) via 
intratracheal injections as provided in the following table. In Table 9, 50 p.1 of each original 

20 prep per mouse, NCR Nude, detection limit >0.033 ng/ml, Day 28 
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Table 9 



Vector 


Total GC 


Hg of AlAT/ml 


ug of AlAT/ml 


Relative Gene 




in 


with 50jil vector 


with 1x10" 


transfer as 




50 ul vector 




vector 


compared to 
rli.10 (clone 
44.2) 


2/1 


3xl0 12 


2.6+0.5 


0.09±0.02 


2.2 


2/2 


5.5x10" 


<0.03 


<0.005 


<0.1 


2/5 


3.6xl0 12 


0.65+0.16 


0.02+0.004 


0.5 


2/7 


4.2xl0 12 • 


1+0.53 


0.02+0.01 


0.5 


2/8 


7.5x10" 


0.9+0.7 


0.12+0.09 


2.9 


2/ch.5 (A.3.1) 


9xl0 12 


1+0.7 


0.0HO.008 


0.24 


2/rh.S (43.25) 


4.6xl0 12 


26+21 


0.56+0.46 


13.7 


2/rh.lO (44.2) 


2.8xl0 12 


115+38 


4.1+1.4 


100 


2/rh.l3 (42.2) 


6xl0 12 


7.3+0.8 


0.12+0.01 


2.9 


2/rh.21 (42.10) 


2.4xl0 12 


9±0.9 


0.38+0.04 


9.3 


2/rh.22 (42.11) 


2.6xl0 12 


6±0.4 


0.23+0.02 


5.6 


2/rh.24 (42.13) 


1.1x10" 


0.4+0.3 


0.4+0.3 


1 



The vectors were also administered to immune competent animals (C57BL/6) in equal 
5 genome copies (lxlO 1 1 GC) as shown in the Table 10. (1x10" GC per animal, C57BL/6, day 
1 4, detection limit >0.033 ug/ml) 



Table 10 



AAV Vector 


ug of AlAT/ml 


Relative Gene transfer as 




with 1x10" vector 


compared to rh.10 (clone 
44.2) 


2/1 


0.076+0.031 


2.6 


2/2 


0.1+0.09 


3.4 


2/5 


0.0840.033 


2.9 


2/7 


0.33±0.01 


11 


2/8 


1.92+1.3 


2.9 


2/ch.5 (A.3.1) 


0.048+0.004 


1.6 


2/rh.8 (43.25) 


1.7+0.7 


58 


2/rh.l 0(44.2) 


2.93+1.7 


100 


2/rh.l3 (42.2) 


0.45+0.15 


15 


2/rh.21 (42.10) 


0.86+0.32 


29 


2/rh.22 (42.11) 


0.38+0.18 


13 


2/rh.24 (42.13) 


0.3+0.19 


10 



10 The data from both experiments confirmed the superb tropism of clone 44.2 in lung- 

directed gene transfer. 
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Interestingly, performance of clone 44.2 in liver and muscle directed gene transfer 
was also outstanding, close to that of the best liver transducer, AAV8 and the best muscle 
transducer AAV1, suggesting that this novel AAV has some intriguing biological 
significance. 

To study serological properties of those novel AAVs, pseudotyped AAVGFP vectors 
were created for immunization of rabbits and in vitro transduction of 84-3 1 cells in the 
presence and absence of antisera against different capsids. The data are summarized below: 

Table 11a. Cross-NAB assay in 8431 cells and adenovirus (Adv) coinfection 
Infection in 8431 cells (coinfected with Adv) with: 



Serum 
from rabbit 
immunized with: 


10 9 GC 


10 9 GC 


10 9 GC 


10 !0 GC 


rh.13 


rh.21 


rh.22 


rh. 24 i 


AAV2/42.2 


AAV2/42.10 


AAV2/42.11 


AAV2/42.13 


AAV2/1 


1/20 


1/20 


1/20 


No NAB 


AAV2/2 


1/640 


1/1280 


1/5120 


No NAB 


AAV2/5 


No NAB 


1/40 


1/160 


No NAB 


AAV2/7 


1/81920 


1/81920 


1/40960 


1/640 


AAV2/8 


1/640 


1/640 


1/320 


1/5120 


CM. 5 AAV2/A3 


1/20- 


1/160 


1/640 


1/640 


rh.8 
AAV2/43.25 


1/20 


1/20 


1/20 


1/320 


rh. 10 

AAV2/44.2 


No NAB 


No NAB 


No NAB 


1/5120 


rh. 13 

AAV2/42.2 


1/5120 


1/5120 


1/5120 


No NAB 


rh.21 
AAV2/42.10 


1/5120 


1/10240 


1/5120 


1/20 


rh.22 
AAV2/42.11 


1/20480 


1/20480 


1/40960 


No NAB 


rh.24 
AAV2/42.13 


No NAB 


1/20 


1/20 


1/5120 
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Table lib. Cross-NAB assay in 8431 cells andAdv coinfection 



Jnfection in 8431 cells (coinfected withAdv) with: 



Serum 
from rabbit 
immunized with: 


10 9 GC 


10 10 GC 


10 10 GC 


10 9 GC 


10 9 GC 


rh.12 


ch.5 


rh.8 


rh.10 


rh.20 


AAV2/42.1B 


AAV2/A3 


AAV2/43.25 


AAV2/44.2 


AAV2/42.8.2 


AAV2/1 • 


No NAB 


1/20480 


No NAB 


1/80 


ND 


AAV2/2 


1/20 


No NAB 


No NAB 


No NAB 


ND 


AAV2/5 


No NAB 


1/320 


No NAB 


No NAB 


ND 


AAV2/7 


1/2560 


1/640 


1/160 


1/81920 


ND 


AAV2/8 


1/10240 


1/2560 


1/2560 


1/81920 


ND 


c/».5AAV2/A3 


1/1280 


1/10240 


ND 


1/5120 


1/320 


rh.S AAV2/43.25 


1/1280 


ND 


1/20400 


1/5120 


1/2560 


rh.10 AAV2/44.2 


1/5120 


ND 


ND 


1/5120 


1/5120 


rh.13 AAV2/42.2 


1/20 


ND 


ND 


No NAB 


1/320 


rh.21 AAV2/42.10 


1/20 


ND 


ND 


1/40 


1/80 


rh.22 AAV2/42.11 


No NAB 


ND 


ND 


ND 


No NAB 


/•/?.24AAV2/42.13 


1/5120 


ND 


ND 


ND 


1/2560 



Table 12 



Titer of rabbit sera 


Titer after 
Boosting 


Vector Titer d21 


ch.5 AAV2/A3 


1/10,240 


1/40,960 


rh.8 AAV2/43.25 


1/20,400 


1/163,840 


rh.10 AAV2/44.2 


1/10,240 


1/527,680 


rh.13 AAV2/42.2 


1/5,120 


1/20,960 


rh.21 AAV2/42.10 


1/20,400 


1/81,920 


rh.22 AAV2/42.11 


1/40,960 


ND 


rh.24 AAV2/42.13 


1/5,120 


ND 



Table 13 a. Infection in 8431 cells (coinfected with Adv) with GFP 





10 9 GC/well 


10 9 GC/well 


10 9 GC/well 


10 9 GC/well 


10 9 GC/well 


10 9 GC/well 












ch.5 


AAV2/1 


AAV2/2 


AAV2/5 


AAV2/7 


AAV2/8 


AAV2/A3 


# GFU/field 


128 


>200 


95 


56 


13 


1 


83 


>200 


65 


54 


11 


1 
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Table 13b. Infection in 8431 cells (coinfected with Adv) with GFP 





10 9 GC/well 


10 9 GC/well 


10 9 GC/well 


10 9 GC/well 


10 9 GC/well 


10 9 GC/well 


10° GC/well 




rh.8 


rh.10 


rh.13 


rh.21 


rh.22 


rh.24 


rh.12 




AAV2/43.25 


AAV2/44.2 


AAV2/42.2 


AAV2/42.10 


AAV2/42.11 


AAV2/42.13 


AAV2/42.1B 




3 


13 


54 


62 


10 


3 


18 


# GFU/field 


2 


12 


71 


60 


14 


2 


20 








48 


47 


16 


3 


12 



5 Example 10 - Mouse Model of Familial Hypercholesterolemia 

The following experiment demonstrates that the AAV2/7 construct of the 
invention delivers the LDL receptor and express LDL receptor in an amount sufficient to 
reduce the levels of plasma cholesterol and triglycerides in animal models of familial 
hypercholesterolemia. 

10 A. Vector Construction 

AAV vectors packaged with AAV7 or AAV8 capsid proteins were 
constructed using a pseudotyping strategy [Hildinger M 3 et al, 1 Virol 2001; 75:6199-6203]. 
Recombinant AAV genomes with AAV2 inverted terminal repeats (ITR) were packaged by 
triple transfection of 293 cells with the cu-plasmid, the adenovirus helper plasm id and a 

15 chimeric packaging construct, a fusion of the capsids of the novel AAV serotypes with the 
rep gene of AAV2. The chimeric packaging plasmid was constructed as previously described 
[Hildinger et al, cited above]. The recombinant vectors were purified by the standard CsCU 
sedimentation method. To determine the yield TaqMan (Applied Biosystems) analysis was 
performed using probes and primers targeting the SV40 poly(A) region of the vectors [Gao 

20 GP 5 et aL, Hum Gene Thar. 2000 Oct 10;11(1 5):2079-91]. The resulting vectors express the 
transgene under the control of the human thyroid hormone binding globulin gene promoter 
(TBG). 

B. Animals 

LDL receptor deficient mice on the C57B1/6 background were purchased 
25 from the Jackson Laboratory (Bar Harbor, ME, USA) and maintained as a breeding colony. 
Mice were given unrestricted access to water and obtained a high fat Western Diet (high % 
cholesterol) starting three weeks prior vector injection. At day -7 as well at day 0, blood 
was obtained via retroorbital bleeds and the lipid profile evaluated. The mice were randomly 
divided into seven groups. The vector was injected via an intraportal injection as previously 
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described ([Chen SJ etal, Mol Therapy 2000; 2(3), 256-261]. Briefly, the mice were 
anaesthetized with ketamine and xylazine. A laparotomy was performed and the portal vein 
exposed. Using a 30g needle the appropriate dose of vector diluted in lOOul PBS was 
directly injected into the portal vein. Pressure was applied to the injection site to ensure a 
5 stop of the bleeding. The skin wound was closed and draped and the mice carefully 

monitored for the following day. Weekly bleeds were performed starting at day 14 after liver 
directed gene transfer to measure blood lipids. Two animals of each group were sacrificed at 
the time points week 6 and week .12 after vector injection to examine atherosclerotic plaque 
size as well as receptor expression. The remaining mice were sacrificed at week 20 for 
1 0 plaque measurement and determination of transgene expression. 



Table 14 





Vector 


dose 


n 


Group 1 


AAV2/7-TBG-hLDLr 


lx 10 12 gc 


12 


Group 2 


AAV2/7-TBG-hLDLr 


3x 10" gc 


12 


Group 3 


A A V2/7-TBG -hLDLr 


lxl0"gc 


12 


Group 4 


AAV2/8-TBG-hLDLr 


lxl0 ,2 gc 


12 


Group 5 


AAV2/8-TBG-hLDLr 


3x 10" gc 


12 


Group 6 


AAV2/8-TBG-hLDLr 


lx 10" gc 


12 


Group 7 


AAV2/7-TBG- LacZ 


lxl0"gc 


16 



C Serum lipoprotein and liver function analysis 
15 Blood samples were obtained from the retroorbital plexus after a 6 hour 

fasting period. The serum was separated from the plasma by centrifugation. The amount of 
plasma lipoproteins and liver transaminases in the serum were detected using an automatized 
clinical chemistry analyzer (ACE, Schiapparelli Biosystems, Alpha Wassermann) 
D. Detection of transgene expression 
20 LDL receptor expression was evaluated by immuno-fluorescence stainin* 

and Western blotting. For Western Blot frozen liver tissue was homogenized with lysis 
buffer ( 20 mM Tris, pH7.4, 130mM NaCI, 1% Triton X 100, proteinase inhibitor (complete, 
EDTA-free, Roche, Mannheim, Germany). Protein concentration was determined using the 
Micro BCA Protein Assay Reagent Kit (Pierce, Rockford, IL). 40 pg of protein was resolved 
25 on 4- 1 5% Tris-HCI Ready Gels (Biorad, Hercules, CA) and transferred to a nitrocellulose 
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membrane (Invitrogen, ). To generate Anti-hLDL receptor antibodies a rabbit was injected 
intravenously with an AdhLDLr prep (lxlO 13 GC). Four weeks later the rabbit serum was 
obtained and used for Western Blot. A 1:100 dilution of the serum was used as a primary 
antibody followed by a HRP-conjugated anti-rabbit IgG and ECL chemiluminescent 
5 detection (ECL Western Blot Detection Kit, Amersham, Arlington Heights, IL). 
E . Imm uno cyto chem is try 

For determination of LDL receptor expression in frozen liver sections 
immunohistochemistry analyses were performed. lOum cryostat sections were either fixed in 
acetone for 5 minutes, or unfixed. Blocking was obtained via a 1 hour incubation period with 

10 10% of goat serum. Sections were then incubated for one hour with the primary antibody at 
room temperature. A rabbit polyclonal antibody anti-human LDL (Biomedical Technologies 
Inc., Stoughton, MA) was used diluted accordingly to the instructions of the manufacturer. 
The sections were washed with PBS, and incubated with 1:100 diluted fluorescein goat anti- 
rabbit IgG (Sigma, St Louis, MO). Specimens were finally examined under fluorescence 

15 microscope Nikon Microphot-FXA. In all cases, each incubation was followed by extensive 
washing with PBS. Negative controls consisted of preincubation with PBS, omission of the 
primary antibody, and substitution of the primary antibody by an isotype-matched non- 
immune control antibody. The three types of controls mentioned above were performed for 
each experiment on the same day. 

20 F. Gene transfer efficiency 

Liver tissue was obtained after sacrificing the mice at the designated 
time points. The tissue was shock frozen in liquid nitrogen and stored at -80°C until further 
processing. DNA was extracted from the liver tissue using a QIAamp DNA Mini Kit 
(QIAGEN GmbH, Germany) according to the manufacturers protocol. Genome copies of 

25 AAV vectors in the liver tissue were evaluated using Taqman analysis using probes and 
primers against the SV40 poly(A) tail as described above. 

G. Atherosclerotic plaque measurement 

For the quantification of the atherosclerotic plaques in the mouse 
aorta the mice were anaesthetized (10% ketamine and xylazine, ip), the chest opened and the 

30 arterial system perfused with ice-cold phosphate buffered saline through the left ventricle. 
The aorta was then carefully harvested, slit down along the ventral midline from the aortic 
arch down to the femoral arteries and fixed in formalin. The lipid-rich atherosclerotic 
plaques were stained with Sudan IV (Sigma, Germany) and the aorta pinned out flat on a 
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black wax surface. The image was captured with a Sony DXC-960 MD color video camera. 
The area of the plaque as well as of the complete aortic surface was determined using Phase 
3 Imaging Systems (Media Cybernetics). 

H. Clearance of 1 125 LDL 

Two animals per experimental group were tested. A bolus off 125 -- 
labeled LDL (generously provided by Dan Rader, U Penn) was infused slowly through the 
tail vein over a period of 30 sec (1,000,000 counts of [1 125 ]-LDL diluted in lOOjx! sterile 
PBS/ animal). At time points 3min, 30 min, 1.5hr, 3hr s 6hr after injection a blood sample 
was obtained via the retro-orbital plexus. The plasma was separated off from the whole 
blood and lOjul plasma counted in the gamma counter. Finally the fractional catabolic rate 
was calculated from the lipoprotein clearance data. 

I. Evaluation of Liver Lipid accumulation 

Oil Red Staining of frozen liver sections was performed to determine 
lipid accumulation. The frozen liver sections were briefly rinsed in distilled water followed 
by a 2 minute incubation in absolute propylene glycol. The sections were then stained in oil 
red solution (0.5% in propylene glycol) for 16 hours followed by counterstaining with 
Mayer's hematoxylin solution for 30 seconds and mounting in warmed glycerin jelly 
solution. 

For quantification of the liver cholesterol and triglyceride content liver 
sections were homogenized and incubated in.chloroform/methanol (2:1) overnight. After 
adding of 0.05% H 2 S0 4 and centrifugation for 10 minutes, the lower layer of each sample 
was collected, divided in two aliquots and dried under nitrogen. For the cholesterol 
measurement the dried lipids of the first aliquot were dissolved in 1% Triton 3C-100 in 
chloroform. Once dissolved, the solution was dried under nitrogen. After dissolving the 
lipids in ddH 2 0 and incubation for 30 minutes at 37°C the total cholesterol concentration was 
measured using a Total Cholesterol Kit (Wako Diagnostics). For the second aliquot the dried 
lipids were dissolved in alcoholic KOH and incubated at 60°C for 30 minutes. Then 1M 
MgCI2 was added, followed by incubation on ice for 10 minutes and centrifugation at 14,000 
rpm for 30 minutes. The supernatant was finally evaluated for triglycerides (Walco 
Diagnostics). 

All of the vectors pseudotyped in an AAV2/8 or AAV2/7 capsid lowered 
total cholesterol, LDL and triglycerides as compared to the control. These test vectors also 
corrected phenotype of hypercholesterolemia in a dose-dependent manner. A reduction in 
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plaque area for the AAV2/8 and AAV2/7 mice was observed in treated mice at the first test: 
(2 months), and the effect was observed to persist over the length of the experiment (6 
months). 

5 Example 1 0 - Functional Factor IX Expression and Correction of Hemophilia 
A. Knock-Out Mice 

Functional canine factor IX (FIX) expression was assessed in hemophilia B 
mice. Vectors with capsids of AAV1, AAV2, AAV5, AAV7 or AAVS were constructed to 
deliver AAV2 5' ITR - liver-specific promoter [LSP] - canine FIX - woodchuck hepatitis 
10 post-regulatory element (WPRE) - AAV2 3 5 ITR . The vectors were constructed as 
described in Wang et al, 2000 5 Molecular Therapy 2: 154-158), using the appropriate 
capsids. 

Knock-out mice were generated as described in Wang et al, 1997. Proc. Natl. 
Acad ScL USA 94: 1 1563-1 1566. This model closely mimic the phenotypes of hemophilia 
15 B in human. 

Vectors of different serotypes (AAV1, AAV2, AAVS, AAV7 and AAVS) 
were delivered as a single intraportal injection into the liver of adult hemophiliac C57BI/6 
mice in a dose of lxl 0 11 GC/mouse for the five different serotypes and one group received 
an AAVS vector at a lower dose, lxl 0 10 GC/mouse. Control group was injected with lx]0 n 
20 GC of AAV2/8 TBG LacZ3. Each group contains 5-10 male and female mice. Mice were 
bled bi-weekly after vector administration. 

1. ELISA 

The canine FIX concentration in the mouse plasma was determined 
by an ELISA assay specific for canine factor IX, performed essentially as described by 

25 Axelrod et al, 1990, Proc.NatLAcad.Sci.USA, 87:5173-51 77 with modifications. Sheep anti- 
canine factor IX (Enzyme Research Laboratories) was used as primary antibody and rabbit 
anti-canine factor IX ((Enzyme Research Laboratories) was used as secondary antibody. 
Beginning at two weeks following injection, increased plasma levels of cFIX were detected 
for all test vectors. The increased levels were sustained at therapeutic levels throughout the 

30 length of the experiment, i.e., to 12 weeks. Therapeutic levels are considered to be 5% of 
normal levels, i.e., at about 250 ng/mL. 

The highest levels of expression were observed for the AAV2/8 (at 10 n ) and 
AAV2/7 constructs, with sustained superphysiology levels cFIX levels (ten-fold higher than 
the norma! level). Expression levels for AAV2/8 (10 n ) were approximately 10 fold higher 
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than those observed for AAV2/2 and AAV2/8 (10 10 ) . The lowest expression levels, 
although still above the therapeutic range, were observed for AAV2/5. 

2. In Vifro Activated Partial Thromboplastin time (aPTT) Assay 

Functional factor IX activity in plasma of the FIX knock-out mice 
5 was determined by an in vifro activated partial thromboplastin time (aPTT) assay-Mouse 
blood samples were collected from the retro-orbital plexus into 1/10 volume of citrate buffer. 
The aPTT assay was performed as described by Wang et al, 1997, Proc. Natl: Acad. Set 
USA 94: 1 1563-11566. 

Clotting times by aPTT on plasma samples of all vector injected mice 
10 were within the normal range (approximately 60 sec) when measured at two weeks post- 
injection, and sustained clotting times in the normal or shorter than normal range throughout 
the study period (12 weeks). 

Lowest sustained clotting times were observed in the animals 
receiving AAV2/8 (10 n ) and AAV2/7. By week 12, AAV2/2 also induced clotting times 
15 similar to those for AAV2/8 and AAV2/7. However, this lowered clotting time was not 
observed for AAV2/2 until week 12, whereas lowered clotting times (in the 25 - 40 sec 
range) were observed for AAV2/8 and AAV2/7 beginning at week two. 

Immuno-histochemistry staining on the liver tissues harvested from 
some of the treated mice is currently being performed. About 70-80% of hepatocytes are 
20 stained positive for canine FIX in the mouse injected with AAV2/8.cFIX vector. 
B . Hemophilia B Dogs 

Dogs that have a point mutation in the catalytic domain of the F.IX gene, 
which, based on modeling studies, appears to render the protein unstable, suffer from 
hemophilia B [Evans et al, 1989, Proc. Natl. Acad. Sci. USA, 86:10095-10099). A colony of 
25 such dogs has been maintained for more than two decades at the University of North 

Carolina, Chapel Hill. The homeostatic parameters of these dogs are well described and 
include the absence of plasma F.IX antigen, whole blood clotting times in excess of 60 
minutes, whereas normal dogs are 6-8 minutes, and prolonged activated partial 
thromboplastin time of 50-80 seconds, whereas normal dogs are 13-28 seconds. These dogs 
30 experience recurrent spontaneous hemorrhages. Typically, significant bleeding episodes are 
successfully managed by the single intravenous infusion of 10 ml/kg of normal canine 
plasma; occasionally, repeat infusions are required to control bleeding. 
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10 



Four dogs are injected intraportally with AAV.cFIX according to the 
schedule below. A first dog receives a single injection with AAV2/2.cFIX at a dose of 
3.7x10" genome copies (GC)/kg. A second dog receives a first injection of AAV2/2.cFrX 
(2.8x10" GC/kg), followed by a second injection with AAV2/7.cFIX (2.3xl0 13 GC/kg) at 
day 1 1 80. A third dog receives a single injection with AAV2/2.cFIX at a dose of 4.6x1 0 12 
GC/kg. The fourth dog receives an injection with AAV2/2.cFIX (2.8xl0 12 GC/kg) and an 
injection at day 995 with AAV2/7.cFIX (5x1 0' 2 GC/kg). 

The abdomen of hemophilia dogs are aseptically and surgically opened under 
general anesthesia and a single infusion of vector is administered into the portal vein. The 
animals are protected from hemorrhage in the peri-operative period by intravenous 
administration of normal canine plasma. The dog is sedated, intubated to induce general 
anesthesia, and the abdomen shaved and prepped. After the abdomen is opened, the spleen is 
moved into the operative field. The splenic vein is located and a suture is loosely placed 
proximal to a small distal incision in the vein. A needle is rapidly inserted into the vein, 
15 then the suture loosened and a 5 F cannula is threaded to an intravenous location near the 
portal vein threaded to an intravenous location near the portal vein bifurcation. After 
hemostasis is secured and the catheter balloon inflated, approximately 5.0 ml of vector 
diluted in PBS is infused into the portal vein over a 5 minute interval. The vector infusion is 
followed by a 5.0 ml infusion of saline. The balloon is then deflated, the callula removed 
20 and venous hemostasis is secured. The spleen is then replaced, bleeding vessels are 

cauterized and the operative wound is closed. The animal is extubated having tolerated the 
surgical procedure well. Blood samples are analyzed as described. [Wang et al, 2000, 
Molecular Therapy 2: 154-158] 

Results showing correction or partial correction are anticipated for AAV2/7. 



25 



All publications cited in this specification are incorporated herein by 
reference. While the invention has been described with reference to a particularly preferred 
embodiments, it will be appreciated that modifications can be made without departing from 
the spirit of the invention. Such modifications are intended to fall within the scope of the 
30 claims. 
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What is claimed is: 

1. A method of detecting adeno-associated virus (AAV) sequences in a sample, 
said method comprising the steps of: 

(a) subjecting a sample containing DNA to amplification via polymerase 
chain reaction (PCR) using a first set of primers which specifically amplify a first region of 
AAV nucleic acid sequences; 

(b) optionally subjecting the DNA to further amplification using a second set 
of primers which specifically amplify a second region which comprises the first region of 
AAV sequences and sequences which are 5' to the first region, such that AAV 5' extension 
sequences which anneal to the 5' end of the AAV sequences amplified by the primers for the 
first region are obtained; 

(c) optionally subjecting the DNA to further amplification using a third set of 
primers which specifically amplify a third region which comprises the first region of AAV 
sequences and sequences which are 3' to the first region, such that AAV 3' extension 
sequences which anneal to the 3' end of the AAV sequences amplified by the primers for the 
first region are obtained, 

wherein each of said regions is predetermined based upon the 
alignment of the nucleic acid sequences of at least two AAV serotypes and each of said 
regions comprises nucleic acid sequences which are highly conserved over at least 18 base 
pairs at the 5' end, optionally variable sequences in the middle, and sequences which are 
highly conserved over at least 18 base pairs at the 3' end of the sequences of the region, 
relative to the sequences of the at least two aligned AAV serotypes; and 

wherein each of the sets of primers consist of a 5' primer and a 3' 

primer; 

wherein the presence of amplified sequences indicates the presence 
of an AAV in the sample. 

2. A method according to claim 1, wherein steps (b) and (c) are performed, and 
further comprising the step of : 
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(d) using the amplified sequences to construct a sequence comprising a 
partial and/or complete AAV gene, thereby isolating an AAV gene sequence from the 
sample. 

3. • The method according to claim 2, wherein the first region is in the AAV 
capsid, the second region extends to the 3' end of the rep genes, and the third region extends 
to 5' of the AAV 3' ITR, such that step (d) permits the construction of AAV gene sequences 
comprising the complete AAV capsid gene. 

4. The method according to claim 2, further comprising the step of subjecting 
the DNA to amplification with a fourth set of primers which specifically amplifies a fourth 
region which extend 5' to the third region. 

5. The method according to claim 4, wherein the fourth region extends to 3' of 
the AAV 5' ITR, such that step (d) permits the construction of AAV gene sequences 
comprising the complete AAV rep and cap genes. 

6. The method according to claim 1 or claim 2, wherein said sample comprises 
cellular or genomic DNA. 

7. The method according to claim 1 or claim 2, wherein said DNA has been 
extracted from the group consisting of cells, cell culture, tissue, tissue culture, and biological 
fluids. 

8. The method according to claim 1 or claim 2, wherein the first region is about 
250 base pairs in length. 

9. The method according to claim 1 or 2, wherein the first region is highly 
conserved over at least about 25 base pairs at the 5' and/or 3* end of the region. 

10. The method according to claim lor 2, wherein the first region is highly 
conserved over at least about 30 base pairs at the 5' and/or 3' end of the region. 
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1 1 . The method according to claim 1 or claim 2, wherein the highly conserved 
sequences of the first region have at least 80% identity among the aligned AAV serotypes at 
the 5' and/or 3' end of the region. 

12. The method according claim 1 1, wherein the highly conserved. sequences 
of the first region have at least 90% identity among the aligned AAV serotypes at the 5* 
and/or 3' end of the region. 

13. The method according to claim 1 or claim 2, wherein the variable 
sequences in the middle of the first region have less than 70% identity among Realigned 
AAV serotypes. 

14. The method according to claim 1 or claim 2, wherein the first region spans 
about bp 2800 to about 3200 of AAV I, SEQ ID NO:6, and corresponding base pairs in other 
AAV serotypes. 

15. The method according to claim 14, wherein the first region is 257 bp 
spanning bp 2886 to about 3143 bp of AAV 1, SEQ ID NO:6, and corresponding base pairs 
in other AAV serotypes. 

16. The method according to claim 1, wherein the primers are AVlns and 
AV2cas. 

1 7. A method of identifying the serotype of adeno-associated virus (AAV) 
sequences in a sample as known or unknown, said method comprising the steps of: 

(a) obtaining an enzymatic digestion analysis of a sample containing 
DNA molecules comprising AAV sequences which span ail or a portion of bp 2886 to 3143 
of AAV1, SEQ ID NO:l, and corresponding regions of other AAV serotypes; and 

(b) comparing the enzymatic digestion analysis from the sample to 
enzymatic digestion analysis for corresponding regions of one or more AAV serotypes, 
thereby identifying the AAV sequences in the sample as being from one of the one or more 
AAV serotypes or from an unknown serotype. 
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1 8. The method according to claim 1 7, wherein the AAV sequences were 
obtained by the method of claim 1 . 

19. A diagnostic kit for detecting the presence of an unknown adeno-associated 
virus (AAV) in a sample, said kit comprising: 

(a) a first set of primers which specifically amplify a first region of AAV 
nucleic acid sequences; 

(b) optionally a second set of primers specific for a second region of the 
AAV nucleic acid sequences which comprises the first region of AAV sequences and 
sequences which are 5' to the first region, such that AAV 5' extension sequences which 
anneal to the 5" end of the AAV sequences amplified by the primers for the first region are 
obtained; 

(c) optionally a third set of primers which specifically amplify a third region 
which comprises the first region of AAV sequences and sequences which are 3' to the first 
region, such that AAV 3' extension sequences which anneal to the 3' end of the AAV 
sequences amplified by the primers for the first region are obtained, 

wherein each of said regions is predetermined based upon the 
alignment of the nucleic acid sequences of at least two AAV serotypes and each of said 
regions comprises nucleic acid sequences which are highly conserved over at least 1 8 base 
pairs at the 5' end, optionally variable sequences in the middle, and sequences which are 
highly conserved over at least 1 8 base pairs at the 3' end of the sequences of the region, 
relative to the sequences of the at least two aligned AAV serotypes 

wherein each of the sets of primers consist of a 5' primer and a 3' 

primer. 



20. A method for isolating novel adeno-associated viruses (AAV) from a cell, said 
method comprising the steps of: (a) infecting the cell with a virus which provides helper 
functions to said AAV; (b) isolating infectious clones containing AAV; (c) sequencing the 
isolated AAV; and (d) comparing the sequences of the isolated AAV to known AAV 
serotypes, whereby differences in the sequences of the isolated AAV and known AAV 
serotypes indicates the presence of a novel AAV. 

2 1 . The method according to claim 20, wherein said virus is an adenovirus. 
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22. The method according to claim 21, wherein said adenovirus is of human or 
non-human primate origin. 

23. A novel adeno-associated virus (AAV) serotype identified by a method 
according to claim 2 or claim 20. 

24. An isolated adeno-associated virus (AAV) comprising an AAV capsid 
having an amino acid sequence selected from the group consisting of: 

AAV7, amino acids 1 to 737 of SEQ ID NO:2; CI, SEQ ID NO:60; C2, 
SEQIDNO:61; C5, SEQ ID NO:62; A3-3, SEQ ID NO:66; A3-7, SEQ ID NO:67; A3-4, 
SEQ ID NO:68; A3-5, SEQ ID NO: 69; 3.3b, SEQ ID NO: 62; 223.4, SEQ ID NO: 73; 223- 
5, SEQ ID NO:74; 223-10, SEQ ID NO:75; 223-2, SEQ ID NO:76; 223-7, SEQ ID NO: 77; 
223-6, SEQ ID NO: 78; 44-1, SEQ ID NO: 79; 44-5, SEQ ID NO:80; 44-2, SEQ ID NO:8I ; 
42-15, SEQ ID NO: 84; 42-8, SEQ ID NO: 85; 42-13, SEQ ID NO:86; 42-3A, SEQ ID 
NO:87; 42-4, SEQ ID NO:88; 42-5A, SEQ ID NO:89; 42-1B, SEQ ID NO:90; 42-5B, SEQ 
ID NO:91; 43-1, SEQ ID NO: 92; 43-12, SEQ ID NO: 93; 43-5, SEQ ID NO:94; 43-21, 
SEQ ID NO:96; 43-25, SEQ ID NO: 97; 43-20, SEQ ID NO:99; 24.1, SEQ ID NO: 101; 
42.2, SEQ ID NO: 102; 7.2, SEQ ID NO: 103; 27.3, SEQ ID NO: 104; 16.3, SEQ ID NO: 
105; 42.10, SEQ ID NO: 106; 42-3B, SEQ ID NO: 107; 42-1 1, SEQ ID NO: 108; Fl, SEQ 
ID NO: 1 09; F5, SEQ ID NO: 1 1 0; F3, SEQ ID NO: 1 1 1 ; 42-6B, SEQ ID NO: 1 1 2; and 42- 
12, SEQ ID NO: 113. 



25. A protein comprising a fragment of an AAV capsid protein, said fragment 
selected from the group consisting of: 

vp2 capsid protein, amino acids (aa) 138 to 737; 
vp3 capsid protein, aa 203 to 737; 

hypervariable region (HVR)l through 12: aa 146 to 152; aa 182 to 187; aa 
262 to 264; aa 263 to 266; aa 263 to 266; aa 381 to 383; aa 383 to 385; aa 450 to 474; aa 45 1 
to 475; aa 490 to 495; aa491 to 496; aa500 to 504; aa 501 to 505; aa 514 to 522; aa 533 to 
554; aa 534 to 555; aa 581 to 594; aa 583 to 596; aa 658 to 667; aa 660 to 669; and aa 705 tc 
719; aa 707 to 772; 
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aa 24-42, aa25 -28; aa 81 - 85; aal33 to 165; aa 134 - 165; aa 137 to 143; 
aa 154 to 156; aa 194 to 208; aa 261 to 274; aa262 to 274; aa 171 to 173; aa413 to 417; aa 
449 to 478; aa 494 to 525; aa 534 to 571; aa 581 to 601; aa 660 to 671 ; aa 709 to 723; and 

aa 1 to 1 84, aa 1 99 to 259; aa 274 to 446; aa 603 to 659; aa 670 to 706; aa 
724 to 736; aa 1 85 to 198; aa 260 to 273; aa 447 to 477; aa 495 to 602; aa 660 to 669; and aa 
707 to 723, 

wherein the amino acid numbers are those of the AAV7 capsid, SEQ ID 
NO:2, and corresponding regions in the capsid of CI, SEQ ID NO:60; C2, SEQ ID NO:61 ; 
C5,SEQIDNO:62; A3-3, SEQ ID NO:66; A3-7, SEQ IDNO:67; A3-4, SEQ ID NO:68; 
A3-5, SEQ ID NO: 69; 3.3b, SEQ ID NO: 62; 223.4, SEQ ID NO: 73; 223-5, SEQ ID 
NO:74; 223-10, SEQ ID NO:75; 223-2, SEQ ID NO:76; 223-7, SEQ ID NO: 77; 223-6, 
SEQ ID NO: 78; 44-1, SEQ ID NO: 79; 44-5, SEQ ID NO:80; 44-2, SEQ ID NO:81 ; 42-1 5, 
SEQ ID NO: 84; 42-8, SEQ ID NO: 85; 42-13, SEQ ID NO:86; 42-3A, SEQ ID NO:87; 42- 
4, SEQ ID NO:88; 42-5A, SEQ ID NO:89; 42-1B, SEQ ID NO:90; 42-5B, SEQ ID NO:91 ; 
43-1, SEQ ID NO: 92; 43-12, SEQ ID NO: 93; 43-5, SEQ ID NO:94; 43-21, SEQ ID 
NO:96; 43-25, SEQ ID NO: 97; 43-20, SEQ ID NO:99; 24.1, SEQ ID NO: 1 01 ; 42.2, SEQ 
ID NO:102; 7.2, SEQ ID NO: 103; 27.3, SEQ ID NO: 104; 16.3, SEQ ID NO: 105; 42.10, 
SEQ ID NO: 106;42-3B, SEQ ID NO: 107; 42-1 1, SEQ ID NO: 108; Fl, SEQ ID NO: 109; 
F5, SEQ ID NO: 110; F3, SEQ ID NO:l 1 1; 42-6B, SEQ ID NO: 1 12; and 42-12, SEQ ID 
NO: 113. 

26. An artificial adeno-associated virus (AAV) capsid protein comprising one or 
more of the protein fragments according to claim 25. 

27. A recombinant adeno-associated virus (AAV) comprising an artificial capsid 
according to claim 26. 

28. A molecule comprising a nucleic acid sequence encoding a protein according 
to claim 25 to 26. 

29. A molecule comprising a nucleic acid sequence encoding a novel adeno- 
associated virus (AAV) serotype capsid protein having an amino acid sequence selected from 
the group consisting of: AAV7, amino acids 1 to 737 of SEQ ID NO:2; CI, SEQ ID NO:60; 
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C2,SEQ1DN0:61; C5, SEQ ID NO:62; A3-3, SEQ IDNO:66; A3-7, SEQ ID NO:67; 
A3-4, SEQ ID NO:68; A3-5, SEQ ID NO: 69; 3.3b, SEQ ID NO: 62; 223.4, SEQ ID NO: 
73; 223-5, SEQ ID NO:74; 223-10, SEQ ID NO:75; 223-2, SEQ ID NO:76; 223-7, SEQ ID 
NO: 77; 223-6, SEQ ID NO: 78; 44-1 , SEQ ID NO: 79; 44-5, SEQ ID NO:80; 44-2, SEQ ID 
NO:81; 42-15, SEQ ID NO: 84; 42-8, SEQ ID NO: 85; 42-13, SEQ EDNO:86; 42-3A, SEQ 
ID NO:87; 42-4, SEQ ID NO:88; 42-5A, SEQ ID NO:89; 42-1B, SEQ ID NO:90; 42-5B, 
SEQ ID NO:91 ; 43-1 , SEQ ID NO: 92; 43-12, SEQ ID NO: 93; 43-5, SEQ ID NO:94; 43- 
21, SEQ ID NO:96; 43-25, SEQ ID NO: 97; 43-20, SEQ ID NO:99; 24.1, SEQ ID NO: 101 ; 
42.2, SEQ ID NO:102; 7.2, SEQ ID NO: 103; 27.3, SEQ ID NO: 104; 16.3, SEQ ID NO: 
105; 42.10, SEQ ID NO: 106; 42-3B, SEQ ID NO: 107; 42-11, SEQ ID NO: 108; Fl, SEQ 
ID NO: 109; F5, SEQ ID NO: 110; F3, SEQ ID NO:l 1 1; 42-6B, SEQ ID NO: 1 12; and 42- 
12, SEQ ID NO: 113. 

30. A molecule comprising a nucleic acid sequence encoding a novel adeno- 
associated virus (AAV) serotype capsid protein, said nucleic acid sequence selected from the 
group consisting of: AAV5, SEQ ID NO:2; 42-2 , SEQ ID NO:9; 42-8 ,SEQ ID NO:27; 42- 
15 , SEQ ID NO:28; 42-5b ,SEQ ID NO: 29; 42-lb ,SEQ ID NO:30; 42-13, SEQ ID NO: 
31; 42-3a , SEQ ID NO: 32; 42-4 ,SEQ ID NO:33; 42-5a , SEQ ID NO: 34; 42-10, SEQ ID 
NO:35; 42-3b, SEQ ID NO: 36; 42-1 1, SEQ ID NO: 37; 42-6b, SEQ ID NO:38; 43-1 , SEQ 
ID NO: 39; 43-5, SEQ ID NO: 40; 43-12 ,SEQ ID NO:41; 43-20, SEQ ID NO:42; 43-21 , 
SEQ ID NO: 43; 43-23, SEQ ID NO:44; 43-25 ,SEQ ID NO: 45; 44.1, SEQ ID NO:47; 
44.5, SEQ ID NO:47; 223.10, SEQ ID NO:48; 223.2 , SEQ ID NO:49; 223.4, SEQ ID 
NO:50; 223.5, SEQ ID NO: 51; 223.6, SEQ ID NO: 52; 223.7, SEQ ID NO: 53; A3.4, SEQ 
ID NO: 54; A3.5, SEQ ID NO:55; A3.7 , SEQ ID NO: 56; A3.3 ,SEQ ID NO:57; 42.12, 
SEQ ID NO: 58; 44.2, SEQ ID NO: 59; AAV10 , SEQ ID NO: 1 17; AAV1 1, SEQ ID NO: 
118; AAV 12 ,SEQ ID NO:l 19; A3.1, SEQ ID NO: 120; and H6, SEQ ID NO: 25. 

31. A molecule comprising a nucleic acid sequence encoding a fragment of an 
adeno-associated virus capsid protein, said nucleic acid sequence selected from the group 
consisting of: 

vpl, nt 825 to 3049; 
vp2, nt 1234 to 3049; 
vp3, nt 1434 to 3049; 
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nt 468 to 3090; and 
nt 725 to 3090, 

wherein the nucleotides numbers are of AAV7, SEQ ID NO:l and 
correspond to sequences in 42-2 , SEQ ID NO:9; 42-8 ,SEQ ID NO:27; 42-15 , SEQ ID 
NO:28; 42-5b ,SEQ ID NO: 29; 42- lb ,SEQ ID NO:30; 42-13, SEQ ID NO: 31; 42-3a , 
SEQ ID NO: 32; 42-4 ,SEQ ID NO:33; 42-5a , SEQ ID NO: 34; 42-10, SEQ ID NO:35; 42- 
3b, SEQ ID NO: 36; 424 1, SEQ ID NO: 37; 42-6b, SEQ ID NO:38; 43-1, SEQ ID NO: 39; 
43-5, SEQ ID NO: 40; 43-12 ,SEQ ID NO:41; 43-20, SEQ ID NO:42; 43-21, SEQ ID NO: 
43; 43-23, SEQ IDNO:44; 43-25 ,SEQ ID NO: 45; 44.1, SEQ IDNO:47; 44.5, SEQ ID 
NO:47; 223.10, SEQ ID NO:48; 223.2 , SEQ ID NO:49; 223.4, SEQ ID NO:50; 223.5, SEQ 
ID NO: 51; 223.6, SEQ ID NO: 52; 223.7, SEQ ID NO: 53; A3.4, SEQ ID NO: 54; A3.5, 
SEQ ID NO:55; A3.7 , SEQ ID NO: 56; A3.3 ,SEQ ID NO:57; 42.12, SEQ ID NO: 58; 
44.2, SEQ ID NO: 59; AAV 10 , SEQ ID NO: 1 17; AAV11, SEQ ID NO: 118; AAV12 
,SEQ ID NO:l 19; A3.1, SEQ ID NO:120; and H6, SEQ ID NO: 25. 

32. The molecule according to claim 26 or claim 28 to 31, wherein said 
molecule is a plasmid. 

33. The molecule according to claim 26 or claim 28 to 31, wherein said 
molecule further comprises a functional AAV rep gene. 

34. The molecule according to claim 30, wherein said nucleic acid sequence is 
the AAV7 sequence, SEQ ID NO:l . 

35. A method of generating a recombinant adeno-associated virus (AAV) 
comprising an AAV serotype capsid comprising the steps of culturing a host cell containing: 
(a) a molecule according to any of claims 26 or claim 28 to 3 1 which encodes an adeno- 
associated virus capsid; (b) a functional rep gene; (c) a minigene comprising AAV inverted 
terminal repeats (ITRs) and a transgene; and (d) sufficient helper functions to permit 
packaging of the minigene into the AAV capsid protein. 

36. A host cell transfected with an adeno-associated virus according to claim 24 or 
a molecule according to any of claims 26, or claims 28 to 31. 
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37. A composition comprising an AAV according to claim 24 or claim 27, and a 
physiologically compatible carrier. 

38. A composition comprising a molecule according to any of claims 26, or claims 
28 to 31 and a physiologically compatible carrier. 

39. A method of delivering a transgene to a cell, said method comprising the step of 
contacting the cell with an AAV according to claim 24 or claim 27, wherein said rAAV 
comprises the transgene. 

40. A molecule comprising a heterologous adeno-associated virus (AAV) 
serotype 7 nucleic acid sequence, said sequence comprising: 

nucleotides (nt) 1 to 1 07 of SEQ ID NO: 1 ; 
nt 107 to 2215 of SEQ ID NO:l; 
nt 334 to 2215 of SEQ ID NO: 1; 
nt 2222 to 4435 of SEQ ID NO: 1 . 
nt 2633 to 4435 of SEQ ID NO: 1 ; 

nt 283 1 to 443 5 of SEQ ID NO: 1; and 
nt 4704 to 472 1 of SEQ ID NO: 1 . 

41 . A molecule encoding an adeno-associated virus (AAV) serotype 7 rep 
protein or a fragment thereof, said protein or fragment selected from the group consisting of: 

amino acid (aa) 1 to 623, aa 1 to 171 ; aa 172 to 372, aa 373 to 444, and aa 
445 to 623 of SEQ ID NO:3. 

42. A host cell containing a molecule according to claim 40 or 41 . 
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?t-^ GA ATTCGCCCTT 

A GCGGCCGCGA ATTCGCCCTT 

Wr-l GA ATTCGCCCTT 

- GAA TTCGCCCTTT 

AAV1 ACCACGGGCA AGACCAACAT CGCGGAAGCC ATCGCCCACG CCGTGCCCTT 

AAV2 ACTACCGGGA AGACCAACAT CGCGGAGGCC ATAGCCCACA CTGTGCCCTT 

AAV3 ACGACGGGTA AAACCAACAT CGCGGAAGCC ATCGCCCACG CCGTGCCCTT 

AAV8 ACCACCGGCA AGACCAACAT TGCGGAAGCC ATCGCCCACG CCGTGCCCTT 

AAV9 ACCACGGGAA AGACCAACAT CGCAGAAGCC ATTGCCCACG CCGTGCCCTT 

AAV7 ACCACCGGCA AGACCAACAT TGCGGAAGCC ATCGCCCACG CCGTGCCCTT 

44 ~ 2 GA ATTCGCCCTT 
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Fig. 1AC 

1401 1450 

42_2 . CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

42_8 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

42_15 . CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

42__5b .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

42_lb 

42_13 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

4 2_3a .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

42_4 

4 2__5a .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

42JLO 

42_3b 

42_11 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

42_6b . GTCTTCCGC CCAGATCGAT CCCACCCCCG TGATCGTCAC TTCCAACACC 

4 3_1 .CTACGGCTG CATCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

43_5 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

43_12 GGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

4 3_20 .CTACGGCTG. CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

4 3_21 GGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

43_23 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

4 3__25 .CTACGGCTG- CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

4 4_1 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

4 4_5 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

22310 

223_2 [ ] [ [] 

223_4 . . . [ . . . . . 

223 5 ...... 

223_6 ' ] 

223 7 

A3_4 TCTACGGCTG CGTCAACTGG ACCAATGAAA ACTTTCCCTT CAACGATTGC 

A3_5 TCTACGGCTG CGTCAACTGG ACCAATGAAA ACTTTCCCTT CAACGATTGC 

A3_7 TCTACGGCTG CGTCAACTGG ACCAATGAAA ACTTTCCCTT CAACGATTGC 

A3_3 TCTACGGCTG CGTCAACTGG ACCAATGAAA ACTTTCCCTT CAACGATTGC 

4 2 J. 2 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

AAV1 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAATGATTGC 

AAV 2 . CTACGGGTG CGTAAACTGG ACCAATGAGA ACTTTCCCTT CAACGACTGT 

AAV 3 .CTACGGCTG CGTAAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

AAV 8 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAATGATTGC 

AAV 9 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

AAV 7 .CTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 

4 4_2 TCTACGGCTG CGTCAACTGG ACCAATGAGA ACTTTCCCTT CAACGATTGC 
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Fig. IAD 



1451 

GTCGACAAGA 
GTCGACAAGA 


1 oLj X o/i 1 b X b 

TGGTGATCTG 
^ pc^ , T , ^ , z\ ^ p^ ,r ^^ , 

X o o X itlb 

TGGTGATCTG 


b 1 bbb/lbbAb 

GTGGGAGGAG 

b X bbbRbbAb 

GTGGGAGGAG 


bbbAAbAxGA 
GGCAAGATGA 
bbbAAbAI bA 
GGCAAGATGA 


1500 
CGGCCAAGGT 
CGGCCAAGGT 
CGGCCAAGGT 
CGGCCAAGGT 


GTCGACAAGA 
GTCGACAAGA 


TGGTGATCTG 
TGGTGATCTG 


GTGGGAGGAG 
GTGGGAGGAG 


GGCAAGATGA 
GGCAAGATGA 


CGGCCAAGGT 
CGGCCAAGGT 


GTCGACAAGA 


TGGTGATCTG 


GTGGGAGGAG 


GGCAAGATGA 


CGGCCAAGGT 





GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

AACATGTGCG CCGTGATTGA CGGGAACAGC ACCACCTTCG AGCACCAGCA 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGTTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 



GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGAAAGATGA CCGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGAAAGATGA CCGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGAAAGATGA CCGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGAAAGATGA CCGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGGAAGATGA CCGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 

GTCGACAAGA TGGTGATCTG GTGGGAGGAG GGCAAGATGA CGGCCAAGGT 
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Fig. 1AE 



1501 

CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 


GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 


TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 


CAGCAAGGTG 
CAGCAAGGTG 


1550 
CGCGTGGACC 
CGCGTGGACC 


CGTGGAGTCC 
CGTGGAGTCC 


GCCAAGGCCA 
-GCCAAGGCCA 


TTCTCGGCGG 
TTCTCGGCGG 


CAGCAAGGTG 
CAGCAAGGTG 


CGCGTGGACC 
CGCGTGGACC 


CGTGGAGTCC 


GCCAAGGCCA 


TTCTCGGCGG 


CAGCAAGGTG 


CGCGTGGACC 





CGTGGAGTCC 
GCCGTTGCAG 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 



GCCAAGGCCA 
GACCGGATGT 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 



TTCTCGGCGG 
TCAAATTTGA 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 



CAGCAAGGTG 
ACTCACCCGC 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAAGTG 
CAGCAAAGTG 



CGCGTGGACC 
CGTCTGGAGC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGCGTGGACC 
CGCGTGGACC 



CGTGGAATCT 
CGTGGAATCT 
CGTGGAATCT 
CGTGGAATCT 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCG 
CGTGGAGAGC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 



GCCAAAGCCA 
GCCAAAGCCA 
GCCAAAGCCA 
GCCAAAGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAAGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 



TTCTGGGTGG 
TTCTGGGTGG 
TTCTGGGTGG 
TTCTGGGTGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGAGG 
TTCTGGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 



AAGCAAGGTT 
AAGCAAGGTT 
AAGCAAGGTT 
AGGCAAGGTT 
CAGCAAGGTG 
CAGCAAGGTG 
AAGCAAGGTG 
AAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAAGTG 



CGTGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
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Fig. 1AF 

1551 1600 

42_2 AAAAGTGCAA GTCTTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACT 

42_8 AAAAGTGCAA GTCTTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACT 

42_15 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

42_5b AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

42__lb 

42_13 AAAAGTGCAA GTCGTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACT 

42_3a AAAAGTGCAA GTCGTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACT 

42_4 

42_5a AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

42~3b .......... 

42_11 AAAAGTGCAA GTCTTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACT 

42__6b ATGACTTTGG CAAGGTGACA AAGCAGGAAG TCAAAGAGTT CTTCCGCTGG 

4 3_1 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

43_5 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

43J.2 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

43_20 AAAAGTGCAA GTCTTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACC 

43_21 AAAAGTGCAA GTCTTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACC 

4 3_23 AAAAGTGCAA GTCTTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACC 

4 3_25 AAAAGTGCAA GTCTTCCGCC CAGATCGATC CCACCCCCGT GATCGTCACC 

4 4_1 AAAAGTGCAA GCCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

223 4 10 AAAAGTGCAA GTCGT CCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

223_2 ' ' 

223_4 ' 

223_5 ' " \[ ' ' 

223_6 

223 7 [ ..'..[.[['] '" 

A3_4 AGAAATGCAA GTCTTCGGCC CAGATCGACC CGACTCCGGT GATTGTCACC 

A3_5 AGAAATGCAA GTCTTCGGCC CAGATCGACC CGACTCCGGT GATTGTCACC 

A3_7 AGAAATGCAG GTCTTCGGCC CAGATCGACC CGACTCCGGT GATTGTCACC 

A3_3 AGAAATGCAA GTCTTCGGCC CAGATCGACC CGACTCCGGT GATTGTCACC 

42_12 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

AAV1 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

AAV2 AGAAATGCAA GTCCTCGGCC CAGATAGACC CGACTCCCGT GATCGTCACC 

AAV3 AAAAGTGCAA GTCATCGGCC CAGATCGAAC CCACTCCCGT GATCGTCACC 

AAV8 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

AAV9 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACTCCCGT GATCGTCACC 

AAV7 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

4 4_2 AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 



32/105 



WO 03/042397 



PCT/US02/33629 



Fig . 1AG 

1601 

42_2 TCCAACACCA ACATGTGCGC TGTGATTGAC GGGAACAGCA CCACCTTCGA 

42_8 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42_15 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42_5b TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42~13 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42_3a TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

4 2_5a TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42~3b 

42_11 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42_6b GCGCAGGATC ACGTGACCGA GGTGGCGCAT GAGTTCTACG TCAGAAAGGG 

4 3_1 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

4 3_5 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

43_12 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

43_20 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCG CCACCTTCGA 

43.21 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

43_23 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

43_25 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

4 4_1 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

22 3 4 10 TCCAACACCA ACATGTGCG C CGTGATTGAC GGGAACAGCA CCACCTTCGA 

223 2 .......... 

223 4 ' ' 

223 5 ' ' ] 

223_6 ' ' 

223 _7 .......... " 

A3_4 TCTAACACCA ACATGTGCGC CGTGATTGAC GGAAACTCGA CCACCTTCGA 

A3_5 TCTAACACCA ACATGTGCGC CGTGATTGAC GGAAACTCGA CCACCTTCGA 

A3_7 TCTAACACCA ACATGTGCGC CGTGATTGAC GGAAACTCGA CCACCTTCGA 

A3_3 TCTAACACCA ACATGTGCGC CGTGATTGAC GGAAACTCGA CCACCTTCGA 

42_12 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

AAV1 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

AAV2 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACTCAA CGACCTTCGA 

AAV3 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

AAV8 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

AAV9 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

AAV7 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

4 4_2 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 
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Fig. 1AH 



1651 1700 

42_2 GCACCAGCAG CCGTTACAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

4 2_8 GCACCAGCAG CCGTTACAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

4 2_15 GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

42_5b GCACCAGCAG CCGTTACAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

42_lb 

42__13 GCACCAGCAG CCGTTACAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

42_3a GCACCAGCAG CCGTTACAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

42 4 

42__5a GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

42_10 

42 3b 



42_11 GCACCAGCAG 

4 2_6b TGGAGCCAAC 

4 3__1 GCACCAGCAG 

4 3_5 GCACCAGCAG 

43_12 GCACCAGCAG 

4 3_20 GCACCAGCAG 

4 3_2 1 GCACCAGCAG 

4 3_23 GCACCAGCAG 

43_25 GCACCAGCAG 

4 4_1 GCACCA'GCAG 

44 5 GCACCAGCAG 



CCGTTACAAG ACCGGATGTT 
AAGAGACCCG CCCCCGATGA 
CCGTTGCAGG ACCGGATGTT 
CCGTTGCAGG ACCGGATGTT 
CCGTTGCAGG ACCGGATGTT 
CCGTTGCAGG ACCGGATGTT 
CCGTTGCAGG ACCGGATGTT 
CCGTTGCAGG ACCGGATGTT 
CCGTTGCAGG ACCGGATGTT 
CCGTTGCGGG ACCGGATGTT 
CCGTTGCAGG ACCGGATGTT 



CAAATTTGAA CTCACCCGCC 
CGCGGATAAA AGCGAGCCCA 
CAAGTTCGAA CTCACCCGCC 
CAAGTTCGAA CTCACCCGCC 
CAAGTTCGAA CTCACCCGCC 
CAAATTTGAA CTCACCCGCC 
CAAATTTGAA CTCACCCGCC 
CAAATTTGAA CTCACCCGCC 
CAAATTTGAA CTCACCCGCC 
CAAGTTTGAA CTCACCCGCC 
CAAGTTTGAA CTCACCCGCC 



223_10 • 

22 3 2 

223 4 

223 5 

223 6 

223 7 

A3_4 GCACCAGCAG CCGTTGCAAG ACCGGATGTT CAAATTTGAA CTTACCCGCC 

A3_5 GCACCAGCAG CCGTTGCAAG ACCGGATGTT CAAATTTGAA CTTACCCGCC 

A3_7 GCACCAGCAG CCGTTGCAAG ACCGGATGTT CAAATTTGAA CTTACCCGCC 

A3_3 GCACCAGCAG CCGTTGCAAG ACCGGATGTT CAAATTTGAA CTTACCCGCC 

42_12 GCACCAGCAG CCGTTACAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

AAV1 GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

AAV 2 ACACCAGCAG CCGTTGCAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

AAV 3 GCATCAGCAG CCGCTGCAGG ACCGGATGTT TGAATTTGAA CTTACCCGCC 

AAV 8 GCACCAGCAG CCTCTCCAGG ACCGGATGTT TAAGTTCGAA CTCACCCGCC 

AAV 9 GCACCAGCAG CCTCTCCAGG ACCGGATGTT TAAGTTCGAA CTCACCCGCC 

AAV 7 GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

44 2 GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTTGAA CTCACCCGCC 
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Fig. 1AI 

1701 176Q 

42_2 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

42_8 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

42_15 GTCTGGAGCA TGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

4 2_5b GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

4 2_lb 

4 2_13 GTCTGGAGCA TGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

4 42 3 4 GTCTGGAGCA TGACT TTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

42~~10 GTCTGGAGCA TGACTT TGGC AAGGCGACAA AGCAGGAAGT CAAAGAGTTC 

42~3b .......... .......... [ 

4 2_11 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

42_6b AGCGGGCCTG CCCCTCAGTC GCGGATCCAT CGACGTCAGA CGCGGAAGGA 

43_1 GTCTGGAGCA CGACTTTGGC AAGGTGACCA AGCAGGAAGT CAAAGAGTTC 

43_5 GTCTGGAGCA CGACTTTGGC AAGGTGACCA AGCAGGAAGT CAAAGAGTTC 

4 3_12 GTCTGGAGCA CGACTTTGGC AAGGTGACCA AGCAGGAAGT CAAAGAGTTC 

4 3_20 GTCTGGAGCA TGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGAGTTC 

4 3_21 GTCTGGAGCA TGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGAGTTC 

43_23 GTCTGGAGCA TGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGAGTTC 

43_25 GTCTGGAGCA TGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGGGTTC 

44_1 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAGAGAGTTC 

22 3 4 10 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAGAGAGTTC 

223 2 

223 4 [[[ " 

223_5 ' ' ' [ " 

223_6 ' ' ' ] 

223_7 .......... .' 

A3_4 GTTTGGATCA TGACTTTGGC AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

A3_5 GTTTGGATCA TGACTTTGGC AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

A3_7 GTTTGGATCA TGACTTTGGC AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

A3_3 GTTTGGATCA TGACTTTGGC AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

4 2__12 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

AAV1 GTCTGGAGCA TGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

AAV2 GTCTGGATCA TGACTTTGGC AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

AAV3 GTTTGGACCA TGACTTTGGC AAGGTCACCA AACAGGAAGT AAAGGACTTT 

AAV8 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

AAV9 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

AAV7 GTCTGGAGCA CGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGAGTTC 

4 4_2 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAGAGAGTTC 
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Fig. 1AJ 

1 7 SI 

OJ - 1800 

4 2_2 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_B TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_15 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4 2_5b TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4 2_.lb 

' 42_13 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_3a TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_4 

4 2_5a TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_10 

42_3b 

42_11 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_6b GCTCCGGTGG ACTTTGCCGA CAGGTACCAA AACAAATGTT CTCGTCACGC 

43_1 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4 3_5 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4 3_12 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4 3_20 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCCACGT 

4 3_21 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCCACGT 

4 3_23 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCCACGT 

43_25 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCCACGT 

4 4_1 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCACG AGTTCTACGT 

4 4_5 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCACG AGTTCTACGT 

223_10 

223 2 

223_4 

223_5 

223_6 

223_7 . . . . 

A3_4 TTCCGGTGGG CTCAAGATCA CGTGACTGAG GTGGAGCATG AGTTCTACGT 

A3_5 TTCCGGTGGG CTCAAGATCA CGTGACTGAG GTGGAGCATG AGTTCTACGT 

A3_7 TTCCGGTGGG CTCAAGATCA CGTGACTGAG GTGGAGCATG AGTTCTACGT 

A3_3 TTCCGGTGGG CTCAAGATCA CGTGACTGAG GTGGAGCATG AGTTCTACGT 

42_12 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

AAV1 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

AAV2 TTCCGGTGGG CAAAGGATCA CGTGGTTGAG GTGGAGCATG AATTCTACGT 

AAV3 TTCCGGTGGG CTTCCGATCA CGTGACTGAC GTGGCTCATG AGTTCTACGT 

AAV8 TTCCGCTGGG CCAGTGATCA CGTGACCGAG GTGGCGCATG AGTTTTACGT 

AAV9 TTCCGCTGGG CCAGTGATCA CGTGACCGAG GTGGCGCATG AGTTTTACGT 

AAV7 TTCCGCTGGG CCAGTGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4 4_2 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCACG AGTTCTACGT 
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CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGAtTAAAA 

CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 

CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 
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Fig. 1AM 

1901 1950 

42_2 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

42_8 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

42_15 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

42_5b GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

42_lb 

42_13 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

42_3a GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

424 

42 _ 10 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

42~3b \ \ \ . .......... 

42_11 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

42_6b GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAGTGTTC 

43_1 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

43_5 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

43_12 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

43_20 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

43_21 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

43_2 3 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

43_25 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

44_1 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

223 4 10 GCGGAAGGAG CTCCGGTGG A CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

223 2 

223 4 ' . . . . . . , ] ] 

223 5 

223 6 

223_7 .......... 

A3__4 GCGGA...AG CTTCGATAAA CTACGCGGGC AGGTACCAAA ACAAATGTTC 

A3_5 GCGGA ... AG CTTCGATAAA CTACGCGGAC AGGTACCAAA ACAAATGTTC 

A3_7 GCGGA... AG CTTCGATAAA CTACGCGGAC AGGTACCAAA ACAAATGTTC 

A3_3 GCGGA ... AG CTTCGATAAA CTACGCGGAC AGGTACCAAA ACAAATGTTC 

42_12 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

AAV1 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

AAV2 GCGGA... AG CTTCGATCAA CTACGCAGAC AGGTACCAAA ACAAATGTTC 

AAV 3 GCGGA... AG CACCGGCGGA CTACGCGGAC AGGTACCAAA ACAAATGTTC 

AAV3 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

AAV9 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

AAV7 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 

4 4_2 GCGGAAGGAG CTCCGGTGGA CTTTGCCGAC AGGTACCAAA ACAAATGTTC 
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Fig . IAN 

1951 

2000 

TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 

GAATTC GCCCTT GGCTGCGTC 

TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGCT TCCCTG.CAA GACATGCGAG 

GAATTC GCCCTTTCTA CGGCTGCGTC 

TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACATGCGAG 

GAATTC GCCCTTTCTA CGGCTGCGTC 

..GAATTC GCCCTTTCTA CGGCTGCGTC 

TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACGTGCGAG 
TCGTCACGCG GGCATGCTTC AGACGCTGTT TCCCTG.CAA AACGTGCGAG 
TCGTCACGCG GGCATGCTCC AGATGCTGTT TCCCTG.CAA AACGTGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACATGCGAG 



TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG . TCG ACAATGCGAA 
TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG. TCG ACAATGCGAA 
TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG . TCG ACAATGCGAA 
TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG. TCG ACAATGCGAA 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA GACATGCGAG 
TCGTCACGTG GGCATGAATC TGATGCTGTT TCCCTG. CAG ACAATGCGAG 
TCGTCACGTG GGCATGAATC TGATGCTTTT TCCCTG . TAA AACATGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACGTGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGCT TCCCTG.CAA AACGTGCGAG 
TCGTCACGCG GGCATGATTC AGATGCTGTT TCCCTG.CAA AACGTGCGAG 
TCGTCACGCG GGCATGCTTC AGATGCTGTT TCCCTG.CAA AACATGCGAG 
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Fig. 1AO 

2001 

AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCGCGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
A. ACTGGACC A. .ATGAGAA CTTTCCCTTC A. A CGATTGCGTC 

AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAG CATTTGCTTC ACGCACGGGA CCAGAGACTG 

A. ACTGGACC A. .ATGAGAA CTTTCCCTTC A A CGATTGCGTC 

AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 

A. ACTGGACC A. .ATGAGAA CTTTCCCTTC A A CGATTGCGTC 

A . ACTAGACC A. .ATGAGAA CTTTCCCTTC A A CGATTGCGTC 

AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCGGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AAAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGG TCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGG TCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGG TCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 



AGAATGAATC AGAATTCAAA TATCTGCTTC ACACACGGGC AAAAAGACTG 
AGAATGAATC AGAATTCAAA TATCTGCTTC ACACACGGGC AAAAAGACTG 
AGAATGAATC AGAATTCAAA TATCTGCTTC ACACACGGGC AAAAAGACTG 
AGAATGAATC AGAATTCAAA TATCTGCTTC ACACACGGGC AAAAAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CGAGAGACTG 
AGAATGAATC AGAATTCAAA TATCTGCTTC ACTCACGGAC AGAAAGACTG 
AGAATGAATC AAATTTCCAA TGTCTGTTTT ACGCATGGTC AAAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACACACGGGG TCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACACACGGGG TCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACACACGGGG TCAGAGACTG 
AGAATGAATC AGAATTTCAA CATTTGCTTC ACGCACGGGA CCAGAGACTG 
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TCTGTCGTCA 


AAV 8 


CTCAGAGTGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


AAV 9 


CTCAGAGTGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


AAV 7 


TTTAGAGTGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC .... 


. . GGTCGTCA 


44 2 


TTCAGAATGT 


TTCCCCGGCG 


TGTCAGAATC 


TCAACC 


. .GGTCGTCA 
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Fig. 1AQ 

2101 2150 
42 2 GAAAGAGGAC GTATCGGAAA CTCTGTGCCA TTCATCATCT GCTGGGG . CG 



42 8 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTAGGG . 


CG 


42 15 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG. 


CG 


42 5b 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG . 


CG 


42 lb 


. AAGGTCGTG 


GAGTCCGCCA 


AG. . . .GCCA 


TTCATCATCT 


GCTGGGG. 


CG 


42 13 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG. 


CG 


4 2 3a 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG. 


CG 


42 4 


. AAGGTCGTG 


GAGTCCGCCA AG ... . GCCA 


TTCATCATCT 


GCTGGGG. 


CG 


42 5a 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG . 


CG 


42 10 


AA. . . . GGTC 


GTGAAGTCCG 


CCAAG GCCA 


TTCATCATCT 


GCTGGGG . 


CG 


4 2 3b 


AA. . . .GGTC 


fiTGGAGTCCG 


CCAAG GCCA 


TTCATCATCT 


GCTGGGG . 


CG 


42 11 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG. 


CG 


42 6b 




GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG. 


CG 


43 1 


C ft Z\ A Z\ A A A 7\ P 


GTATCAGAAA 


CTGTGTGCCA 


TTCATCATCT 


GCTGGGG . 


CG 


43 5 




GTATCAGAAA 


CTGTGTGCCA 


TTCATCATCT 


GCTGGGG 


CG 


43 12 




GTATCAGAAA 


CTGTGTGCCA 


TTCATCATCT 


GCTGGGG . 


CG 


43 20 


o /ii-i/i o /-\ vjj lo r\ ^ 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG . 


CG 


43 21 


if-V U 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG 


CG 


43 23 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG . 


CG 


4 3 25 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG. 


CG 


44 1 


GAAAAAAGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG . 


CG 


44~~5 
22 3 10 
223 2 
22 3~4 
223~5 
223~6 
2 2 3~~7 

A3~4 


GAAAAAAGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG, 


.CG 






























































GAAAAACG . . 


. TATCAGAAA 


CTTTGTTACA 


TTCATCATAT 


CATGGGA 


. AA 


A3~5 


GAAAAACG . . 


. TATCAGAAA 


CTTTGTTACA 


TTCATCATAT 


CATGGGA 


.AA 


A3 7 


GAAAAACG . . 


. TATCAGAAA 


CTTTGTTACA 


TTCATCATAT 


CATGGGA 


. AA 


A3 3 


GAAAAACG . . 


. TATCAGAAA 


CTTTGTTACA 


TTCATCATAT 


CATGGGA 


. AA 


42_12 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG 


.CG 


AAV1 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCCA 


TTCATCATCT 


GCTGGGG 


.CG 


.AAV 2 


AAAAGGCG. . 


. TATCAGAAA 


CTGTGCTACA 


TTCATCATAT 


CATGGGA 


.AA 


AAV 3 


AAAAGAAGAC 


TTATCAGAAA 


CTGTGTCCAA 


TTCATCATAT 


CCTGGGA 


.AG 


AAV 8 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG 


.CG 


AAV 9 


GAAAGAGGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGG 


.CG 


AAV 7 


GAAAAAAGAC 


GTATCGGAAA 


CTCTGCGCGA 


TTCATCATCT 


GCTGGGG 


.CG 


44 2 


GAAAAAAGAC 


GTATCGGAAA 


CTCTGTGCGA 


TTCATCATCT 


GCTGGGGGCG 
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2151 

GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCACCCGAG 
GGCACCCGAG 
GGCACCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCACCCGAG 
GGCACCCGAG 



ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 



CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 



TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAC 



2200 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTAG 
GTGGACCTAG 



AGAACCAGAC 
AGTACCAGAC 
AGTACCAGAC 
AGTACCAGAC 
GGCTCCCGAG 
GGCTCCCGAG 
GGTGCCAGAC 
GGCACCCGAG 
GGCTCCCGAG 
GGCTCCCGAG 
GGCGCCCGAG 
GGCACCCGAG 



. . . GCCTGCA 
. . . GCCTGCA 
. . . GCCTGCA 
. . . GCCTGCA 
ATTGCTTGCT 
ATTGCTTGCT 
. . . GCTTGCA 
ATTGCCTGTT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 
ATTGCTTGCT 



CTGCCTGCGA 
CTGCCTGCGA 
CTGCCTGCGA 
CTGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CTGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 
CGGCCTGCGA 



CCTGGTAAAT 
CCTGGTAAAT 
CCTGGTAAAT 
CCTGGTAAAT 
TCTGGTCAAC 
TCTGGTCAAC 
TCTGGTCAAT 
TTTGGCCAAT 
TCTGGTCAAC 
TCTGGTCAAC 
CCTGGTCAAC 
TCTGGTCAAC 



GTGGACTTGG 
GTGGACTTGG 
GTGGACTTGG 
GTGGACTTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGATTTGG 
GTGGACTTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTGG 
GTGGACCTAG 
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2201 



2250 



Rep 78 stop vpl start 

42_2 ATGACCGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

42_8 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 2 J. 5 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 2_5b ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 2_lb ATGACTGTGT- TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 2_13 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

42_3a ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

42__4 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 2_5a ATGACTGTGT 'TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 2_10 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

42_3b ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

42_11 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

42_6b ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

43_1 ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

43_5 ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 3_12 ACGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 3_20 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 3_21 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 3_23 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

43_25 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 4__1 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 4__5 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

223^10 - 

223 2 

223 4 

223^5 

223_6 

223 7 

A3__4 ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 

A3_5 ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 

A3J7 ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 

A3_3 ATGACTGTAT TTCTGAGCAA TAAATGACTT AAATCAGGTA TGGCTGCTGA 

42_12 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

AAV1 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

AAV2 ATGACTGCAT CTTTGAACAA TAAATGATTT AAATCAGGTA TGGCTGCCGA 

AAV3 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCTGA 

AAV8 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

AAV 9 ATGACTGTGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

AAV 7 ACGACTGCGT TTCTGAGCAA TAAATGACTT AAACCAGGTA TGGCTGCCGA 

4 4_2 ATGACTGTGT TTCTGAGCAA TAAA TGACTT AAACCAGGT A TG GCTGCCGA 

Rep78 stop vpl start 
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Fig. 1AT 

2251 2300 

Rep 6 8 stop 

42_2 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_8 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_15 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_5b TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_lb TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_13 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42__3a TGGTCATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_4 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_5a TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_10 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_3b TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_11 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

42_6b TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

4 3_1 TGGTTATCTT CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 

43_5 TGGTTATCTT CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 

43_12 TGGTTATCTT CCAGATTGGC TTGAGGACAA CCTCTCTGAG GGCATTCGCG 

43_20 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

43_21 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

4 3_23 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

4 3_25 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

4 4_1 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

4 4_5 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

223_10 

223 2 ' 

223_4 ' ' ' " 

223_5 [ . * \ [ 

223_6 ' . [ ' \ 

223_7 .......... 

A3_4 CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 

A3_5 CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 

A3_7 CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 

A3_3 CGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATCAGAC 

42_12 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATCCGCG 

AAV1 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

AAV2 TGGTTATCTT CCAGATTGGC TCGAGGACAC TCTCTCTGAA GGAATAAGAC 

AAV 3 CGGTTATCTT CCAGATTGGC TCGAGGACAA CCTTTCTGAA GGCATTCGTG 

AAV 8 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

AAV 9 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

AAV7 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

4 4_2 TGGTTATCTT CCAGATTGGC TCGAGGACAA CCTCTCTGAG GGCATTCGCG 

Rep 68 stop 
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Fig. 1AU 



2 301 2350 
AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 



AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 
AGTGGTGGGA 



CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAGACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CCTGAAACCT 
CCTGAAACCT 
CCTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 
CTTGAAACCT 



GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 
GGAGCCCCGA 



AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 
AACCCAAAGC 



CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 
CAACCAGCAA 



AGTGGTGGAA GCTCAAACCT GGCCCACCAC CGCCGAAACC TAACCAACAA 

AGTGGTGGAA GCTCAAACCT GGCCCACCAC CGCCGAAACC TAACCAACAA 

AGTGGTGGAA GCTCAAACCT GGCCCACCAC CGCCGAAACC TAACCAACAA 

AGTGGTGGAA GCTCAAACCT GGCCCACCAC CGCCGAAACC TAACCAACAA 

AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 

AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AGCCCAAAGC CAACCAGCAA 

AGTGGTGGAA GCTCAAACCT GGCCCACCAC CACCAAAGCC CGCAGAGCGG 

AGTGGTGGGC TCTGAAACCT GGAGTCCCTC AACCCAAAGC GAACCAACAA 

AGTGGTGGGC GCTGAAACCT GGAGCCCCGA AGCCCAAAGC CAACCAGCAA 

AGTGGTGGGC GCTGAAACCT GGAGCCCCGA AGCCCAAAGC CAACCAGCAA 

AGTGGTGGGA CCTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 

AGTGGTGGGA CTTGAAACCT GGAGCCCCGA AACCCAAAGC CAACCAGCAA 
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2351 

AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 
AAGCAGGACG 



ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 
ACGGCCGGGG 



Fig. 1AV 



TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 
TCTGGTGCTT 



CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 
CCTGGCTACA 



2400 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 
AGTACCTCGG 



CACCGGGACG ACAGTAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 
CACCGGGACG ACAGTAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 
CACCGGGACG ACAGTAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 
CACCGGGACG ACAGTAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 
AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 
AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 
CATAAGGACG ACAGCAGGGG TCTTGTGCTT CCTGGGTACA AGTACCTCGG 
CACCAGGACA ACCGTCGGGG TCTTGTGCTT CCGGGTTACA AATACCTCGG 
AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 
AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 
AAGCAGGACA ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 
AAGCAGGACG ACGGCCGGGG TCTGGTGCTT CCTGGCTACA AGTACCTCGG 
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Fig. 1AW 
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ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 
ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 
ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 
ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 
ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 
ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGAGAGCC GGTCAACGAG GCAGACGCCG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 
ACCCTTCAAC GGACTCGACA AGGGGGAGCC CGTCAACGCG GCGGACGCAG 



ACCCTTCAAC 
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ACCCTTCAAC 
ACCCTTCAAC 
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GGTCAACGAG 
GGTCAACGAG 
GGTCAACGAG 
GGTCAACGAG 
GGTCAACGAG 
CGTCAACGCG 
GGTCAACGAG 
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GCAGACGCCG 
GCAGACGCCG 
GCAGACGCCG 
GCAGACGCCG 
GCAGACGCCG 
GCGGACGCAG 
GCAGACGCCG 
GCGGACGCGG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
GCGGACGCAG 
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42 2 


CGGCCCTCGA 


GCACG 


. ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


42 8 


CGGCCCTCGA 


GCACG 


. ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


42 15 


CGGCCCTCGA 


GCACG 


.ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


4 2~~5b 


CGGCCCTCGA 


GCACG 


.ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


42 lb 


CGGCCCTCGA 


GCACG 


.ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


42 13 


CGGCCCTCGA 


GCACG 


.ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


42 3a 


CGGCCCTCGA 


GCACG 


.ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


42 4 


CGGCCCTCGA 


GCACG 


.ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


42 5a 


CGGCCCTCGA 


GCACG 


.ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


42 10 


CGGCCCTCGA 


GCACG, 


.ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


42 3b 


CGGCCCTCGA 


GCACG. 


.ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


42 11 


CGGCCCTCGA 


GCACG , 


.ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


42 6b 


CGGCCCTCGA 


GCACG , 


.ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


43__1 


CGGCCCTCGA 


GCACG . 


.ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


43__5 


CGGCCCTCGA 


GCACG , 


. ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


43_12 


CGGCCCTCGA 


GCACG , 


.ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


4 3_20 


CGGCCCTCGA 


GCACG , 


.ACAA 


AGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


4 3_21 


CGGCCCTCGA 


GCACG , 


.ACAA 


AGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


4 3_23 


CGGCCCTCGA 


GCACG, 


.ACAA 


AGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


43 25 


CGGCCCTCGA 


GCACG , 


.ACAA 


AGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


44 1 


CGGCCCTCGA 


GCACG . 


.ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


44 5 


CGGCCCTCGA 


GCACG , 


.ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


223 10 








GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


223 2 








GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


223 4 








GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


223 5 








GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


223 6 






, . CAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


223 7 








GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


A3 4 


CGGCCCTCGA 


GCACG . 


ACAA 


AGCCTACGAC 


CACCAGCTCA 


AGCAAGGGGA 


A3 5 


CGGCCCTCGA 


GCACG . 


.ACAA 


AGCCTACGAC 


CACCAGCTCA 


AGCAAGGGGA 


A3 7 


CGGCCCTCGA 


GCACG . 


ACAA 


AGCCTACGAC 


CACCAGCTCA 


AGCAAGGGGA 


A3 3 


CGGCCCTCGA 


GCACG . 


ACAA 


AGCCTACGAC 


CACCAGCTCA 


AGCAAGGGGA 


42 12 


CGGCCCTCGA 


GCACG . 


ACAA 


GGCCTACGAC 


AAGCAGCTCG 


AGCAGGGGGA 


AAV1 


CGGCCCTCGA 


GCACG. 


ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


AAV2 


CGGCCCTCGA 


GCACGTACAA 


AGCCTACGAC 


CGGCAGCTCG 


ACAGCGGAGA 


AAV 3 


CAGCCCTCGA 


ACACG . 


ACAA 


AGCTTACGAC 


CAGCAGCTCA 


AGGCCGGTGA 


AAV 8 


CGGCCCTCGA 


GCACG . 


ACAA 


GGCCTACGAC 


CAGCAGCTGC 


AGGCGGGTGA 


AAV 9 


CGGCCCTCGA 


GCACG . 


GCAA 


GGCCTACGAC 


CAGCAGCTGC 


AGGCGGGTGA 


AAV 7 


CGGCCCTCGA 


GCACG . 


ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 


44 2 


CGGCCCTCGA 


GCACG . 


ACAA 


GGCCTACGAC 


CAGCAGCTCA 


AAGCGGGTGA 
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42 2 


CAACCCGTAC 


CTCAAGTACA 


42 8 


CAATCCGTAC 


CTGCGGTATA 


42 15 


CAATCCGTAC 


CTGCGGTATA 


42 5b 


CAACCCGTAC 


CTCAAGTACA 


42 lb 


CAACCCGTAC 


CTCAAGTACA 


42 13 


CAATCCGTAC 


CTGCGGTATA 


42 3a 


CAATCCGTAC 


CTGCGGTATA 


42 4 


CAACCCGTAC 


CTCAAGTACA 


42 5a 


CAACCCGTAC 


CTCAAGTACA 


42 10 


CAACCCGTAC 


CTCAAGTACA 


42 3b 


CAACCCGTAC 


CTCAAGTACA 


42 11 


CAATCCGTAC 


CTGCGGTATA 


42 6b 


CAACCCGTAC 


CTCAAGTACA 


' 43 1 


CAATCCGTAC 


CTGCGGTATA 


43 5 


CAATCCGTAC 


CTGCGGTATA 


43 12 


CAATCCGTAC 


CTGCGGTATA 


43 20 


CAATCCGTAC 


CTGCGGTATA 


43 21 


CAATCCGTAC 


CTGCGGTATA 


43 23 


CAATCCGTAC 


CTGCGGTATA 


43 25 


CAATCCGTAC 


CTGCGGTATA 


44 1 


CAATCCGTAC 


CTGCGGTATA 


44 5 


CAATCCGTAC 


CTGCGGTATA 


223 10 


CAATCCGTAC 


CTGCGGTATA 


223 2 


CAATCCGTAC 


CTGCGGTATA 


223 4 


CAATCCGTAC 


CTGCGGTATA 


223 5 


CAATCCGTAC 


CTGCGGTATA 


223 6 


CAATCCGTAC 


CTGCGGTATA 


223 7 


CAATCCGTAC 


CTGCGGTATA 


A3 4 


CAACCCGTAC 


CTCAAATACA 


A3 5 


CAACCCGTAC 


CTCAAATACA 


A3 7 


CAACCCGTAC 


CTCAAATACA 


A3 3 


CAACCCGTAC 


CTCAAATACA 


42_12 


CAACCCGTAC 


CTCAAGTACA 


AAV1 


CAATCCGTAC 


CTGCGGTATA 


AAV 2 


CAACCCGTAC 


CTCAAGTACA 


AAV 3 


CAACCCGTAC 


CTCAAGTACA 


AAV 8 


CAATCCGTAC 


CTGCGGTATA 


AAV 9 


CAATCCGTAC 


CTGCGGTATA 


AAV 7 


CAATCCGTAC 


CTGCGGTATA 


44 2 


CAATCCGTAC 


CTGCGGTATA 



Pig. 1AY 
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ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ATCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ATCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ATCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ATCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGTGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCGGA CGCTGAATTT CAGGAGCGTC 
ACCACGCGGA CGCTGAATTT CAGGAGCGTC 
ACCACGCGGA CGCTGAATTT CAGGAGCGTC 
ACCACGCGGA CGCTGAATTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCGGAGTTT CAGGAGCGCC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
ACCACGCCGA CGCCGAGTTT CAGGAGCGTC 
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42 2 


TTCAAGAAGA 


TACGTCTTTT 


42 8 


TGCAAGAAGA 


TACGTCTTTT 


42 15 


TGCAAGAAGA 


TACGTCTTTT 


42 5b 


TTCAAGAAGA 


TACGTCTTTT 


42 lb 


TTCAAGAAGA 


TACGTCTTTT 


42 13 


TTCAAGAAGA 


TACGTCTTTT 


42 3a 


TTCAAGAAGA 


TACGTCTTTT 


42 4 


TTCAAGAAGA 


TACGTCTTTT 


42 5a 


TTCAAGAAGA 


TACGTCTTTT 


42 10 


TTCAAGAAGA 


TACGTCTTTT 


42 3b 


TTCAAGAAGA 


TACGTCTTTT 


42 11 


TTCAAGAAGA 


TACGTCTTTT 


42 6b 


TTCAAGAAGA 


TACGTCTTTT 


43 1 


TGCAAGAAGA 


TACGTCTTTT 


43 5 


TGCAAGAAGA 


TACGTCTTTT 


43 12 


TGCAAGAAGA 


TACGTCTTTT 


43 20 


TGCAAGAAGA 


TACGTCTTTT 


43 21 


TGCAAGAAGA 


TACGTCTTTT 


43 23 


TGCAAGAAGA 


TACGTCCTTT 


43 25 


TGCAAGAAGA 


TACGTCTTTT 


44 1 


TGCAAGAAGA 


TACGTCTTTT 


44 5 


TGCAAGAAGA 


TACGTCTTTT 


223 10 


TTCAAGAAGA 


TACGTCTTTT 


223 2 


TTCAAGAAGA 


TACGTCTTTT 


223 4 


TTCAAGAAGA 


TACGTCTTTT 


223 5 


TTCAAGAAGA 


TACGTCTTTT 


223 6 


TTCAAGAAGA 


TACGTCTTTT 


223 7 


TTCAAGAAGA 


TACGTCTTTT 


A3 4 


TTCAAGAAGA 


TACGTCTTTC 


A3 5 


TTCAAGAAGA 


TACGTCTTTC 


A3 7 


TTCAAGAAGA 


TACGTCTTTC 


A3 3 


TTCAAGAAGA 


TACGTCTTTC 


42 12 


TTCAAGAAGA 


TACGTCTTTT 


AAV1 


TGCAAGAAGA 


TACGTCTTTT 


AAV 2 


TTAAAGAAGA 


TACGTCTTTT 


AAV3 


TTCAAGAAGA 


TACGTCTTTT 


AAV 8 


TGCAAGAAGA 


TACGTCTTTT 


AAV 9 


TGCAAGAAGA 


TACGTCTTTT 


AAV7 


TGCAAGAAGA 


TACGTCATTT 


44 2 


TGCAAGAAGA 


TACGTCTTTT 



Fig. 1AZ 
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GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCGG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGACGAGC AGTCTTCCAG 
GGGGGCAACC TTGGCAGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
GGGGGCAACC TCGGGCGAGC AGTCTTCCAG 
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42 2 


GCCAAGAAGC 


GGGTTCTCGA 


42 8 


GCCAAGAAGC 


GGGTTCTCGA 


42 15 


GCCAAGAAGC 


GGGTTCTCGA 


42 5b 


GCCAAGAAGC 


GGGTTCTCGA 


42 lb 


GCCAAGAAGC 


GGGTTCTCGA 


42 13 


GCCAAGAAGC 


GGGTTCTCGA 


42_3a 


GCCAAGAAGC 


GGGTTCTCGA 


42 4 


GCCAAGAAGC 


GGGTTCTCGA 


42 5a 


GCCAAGAAGC 


GGGTTCTCGA 


42 10 


GCCAAGAAGC 


GGGTTCTCGA 


42 3b 


GCCAAGAAGC 


GGGTTCTCGA 


42J.1 


GCCAAGAAGC 


GGGTTCTCGA 


42 6b 


GCCAAGAAGC 


GGGTTCTCGA 


43 1 


GCCAAGAAGC 


GGGTTCTCGA 


43 5 


GCCAAGAAGC 


GGGTTCTCGA 


43 12 


GCCAAGAAGC 


GGGTTCTCGA 


43 20 


GCCAAGAAGC 


GGGTTCTCGA 


43 21 


GCCAAGAAGC 


GGGTTCTCGA 


43 23 


GCCAAGAAGC 


GGGTTCTCGA 


43 25 


GCCAAGAAGC 


GGGTTCTCGA 


44 1 


GCCAAGAAGC 


GGGTTCTCGA 


44 5 


GCCAAGAAGC 


GGGTTCTCGA 


223 10 


GCCAAAAAGC 


GGGTTCTCGA 


223 2 


GCCAAAAAGC 


GGGTTCTCGA 


223 4 


GCCAAAAAGC 


GGGTTCTCGA 


223 5 


GCCAAAAAGC 


GGGTTCTCGA 


223 6 


GCCAAAAAGC 


GGGTTCTCGA 


223 7 


GCCAAAAAGC 


GGGTTCTCGA 


A3 4 


GCCAAAAAGA 


GGGTACTCGA 


A3 5 


GCCAAAAAGA 


GGGTACTCGA 


A3 7 


GCCAAAAAGA 


GGGTACTCGA 


A3 3 


GCCAAAAAGA 


GGGTACTCGA 


4 2J.2 


GCCAAGAAGC 


GGGTTCTCGA 


AAV1 


GCCAAGAAGC 


GGGTTCTCGA 


AAV 2 


GCGAAAAAGA 


GGGTTCTTGA 


AAV 3 


GCCAAAAAGA 


GGATCCTTGA 


AAV 8 


GCCAAGAAGC 


GGGTTCTCGA 


AAV 9 


GCCAAGAAGC 


GGGTTCTCGA 


AAV 7 


GCCAAGAAGC 


GGGTTCTCGA 


44 2 


GCCAAGAAGC 


GGGTTCTCGA 



Fig. 1AAA 
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ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
ACCTCTTGGT CTGGTTGAGA CGCCAGCTAA 
GCCTCTTGGT CTGGTTGAGG AAGCTGTTAA 
GCCTCTTGGT CTGGTTGAGG AAGCTGTTAA 
GCCTCTTGGT CTGGTTGAGG AAGCTGTTAA 
GCCTCTTGGT CTGGTTGAGG AAGCTGTTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTGGGC CTGGTTGAGG AACCTGTTAA 
GCCTCTTGGT CTGGTTGAGG AAGCAGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 
ACCTCTCGGT CTGGTTGAGG AAGGCGCTAA 



53/105 



WO 03/042397 



PCT/US02/33629 



2651 
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42 2 


GACGGCTCCT 


GGAAAGAAGA 


42 8 


GACGGCTCCT 


GGAAAGAAGA 


42 15 


GACGGCTCCT 


GGAAAGAAGA 


42 5b 


GACGGCTCCT 


GGAAAGAAGA 


4 2 lb 


GACGGCTCCT 


GGAAAGAAGA 


42 13 


GACGGCTCCT 


GGAAAGAAGA 


42 3a 


GACGGCTCCT 


GGAAAGAAGA 


42 4 


GACGGCTCCT 


GGAAAGAAGA 


42 5a 


GACGGCTCCT 


GGAAAGAAGA 


42 10 


GACGGCTCCT 


GGAAAGAAGA 


42 3b 


GACGGCTCCT 


GGAAAGAAGA 


42 11 


GACGGCTCCT 


GGAAAGAAGA 


42 6b 


GACGGCTCCT 


GGAAAGAAGA 


43 1 


GACGGCTCCT 


GGAAAGAAGA 


43 5 


GACGGCTCCT 


GGAAAGAAGA 


4 3 12 


GACGGCTCCT 


GGAAAGAAGA 


43 20 


GACGGCTCCT 


GGAAAGAAGA 


43 21 


GACGGCTCCT 


GGAAAGAAGA 


43 23 


GACGGCTCCT 


GGAAAGAAGA 


43 25 


GACGGCTCCT 


GGAAAGAAGA 


44 1 


GACGGCTCCT 


GGAAAGAAGA 


44 5 


GACGGCTCCT 


GGAAAGAAGA 


223 10 


GACGGCACCT 


GGAAAGAAGC 


223 2 


GACGGCACCT 


GGAAAGAAGC 


223 4 


GACGGCACCT 


GGAAAGAAGC 


223 5 


GACGGCACCT 


GGAAAGAAGC 


223~6 


GACGGCACCT 


GGAAAGAAGC 


223 7 


GACGGCACCT 


GGAAAGAAGC 


A3 4 


GACGGCTCCT 


GGAAAAAAGA 


A3 5 


GACGGCTCCT 


GGAAAAAAGA 


A3 7 


GACGGCTCCT 


GGAAAAAAGA 


A3 3 


GACGGCTCCT 


GGAAAAAAGA 


42 12 


GACGGCTCCT 


GGAAAGAAGA 


AAV1 


GACGGCTCCT 


GGAAAGAAAC 


AAV 2 


GACGGCTCCG 


GGAAAAAAGA 


AAV 3 


AACGGCTCCT 


GGAAAGAAGG 


AAV 8 


GACGGCTCCT 


GGAAAGAAGA 


AAV 9 


GACGGCTCCT 


GGAAAGAAGA 


AAV7 


GACGGCTCCT 


GCAAAGAAGA 


44 2 


GACGGCTCCT 


GGAAAGAAGA 




vp2 start 
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GACCCATAGA . . .ATCCCCC 

GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA . . . ATCCCCC 

GACCCATAGA ...ATCCCCC 

GACCCATAGA . . . ATCCCCC 

GACCCATAGA . . . ATCCCCC 

GACCCATAGA ...ATCCCCC 

GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCT CAGCGTTCCC 
GACCGGTAGA GCCATCACCT CAGCGTTCCC 
GACCGGTAGA GCCATCACCT .CAGCGTTCCC 
GACTGGTAGA GCAGTCGCCA CAAGAG . . . C 
GACCGGTAGA GCAGTCGCCA CAAGAG . . . C 
GACCGGTAGA GCAGTCGCCA CAAGAG . . . C 
GACCGGTAGA GCAGTCGCCA CAAGAG. . .C 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCGGTAGA CTCGCCA 

GACCTATAGA GCAGTCTCCT GCAGAA . . .C 
GACCTATAGA GCAGTCTCCT GCAGAA... C 
GACCTATAGA GCAGTCTCCT GCAGAA. . .C 
GACCTATAGA GCAGTCTCCT GCAGAA. . . C 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GTCCGGTAGA GCAGTCGCCA CAAGAG . . . C 
GGCCGGTAGA GCACTCTCCT GTGGAG...C 
GGGCTGTAGA TCAGTCTCCT CAGGAA. ..C 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
GACCGGTAGA GCCGTCACCT CAGCGTTCCC 
GACCGGTAGA GCCATCACCC CAGCGTTCTC 
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97m 


42 2 


GACTCCTC 


42~~8 


CAGACTCCTC 


42 Is 


CAGACTCCTC 


42~5b 


CAGACTCCTC 


42~lb 


GACTCCTC 


42~13 


GACTCCTC 


42~3a 


. . GACTCCTC 


42 4 


GACTCCTC 


4 2 5a 


GACTCCTC 


42~10 


. . GACTCCTC 


42~3b 


. . GACTCCTC 


42~11 


. . GACTCCTC 


4 2 6b 


CAGACTCCTC 


43 1 


CCGACTCCTC 


4 3~5 


CCGACTCCTC 


4 3 12 


CCGACTCCTC 


4 3~20 


CAGACTCCTC 


4 3~21 


CAGACTCCTC 

v^x x V_J 1 x \_> JL n-/ JL V—* 


4 3~23 


CAGACTCCTC 


4 3~25 


CAGACTCCTC 


44" 1 


CAGACTCCTC 


4 4~5 


CAGACTCCTC 


223 10 


. . GACTCCAC 


223 2 


. . GACTCCAC 


223 4 


. . GACTCCAC 


223 5 


. . GACTCCAC 


223~6 


. . GACTCCAC 


2?3~~7 

C— ~J 1 


GACTCCAC 


nj 1 


CGGACTPTTr 








lubrtL lLx IL 




Pf^naPTPTTP 


/I 0 1 O 


LAbAU J. 1 L, 


AAV1 


CAGACTCCTC 


AAV 2 


CAGACTCCTC 


AAV 3 


CGGACTCATC 


AAV 8 


CAGACTCCTC 


AAV 9 


CAGACTCCTC 


AAV7 


CCGACTCCTC 


44 2 


CAGACTCCTC 



Fig. 1AAC 



CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGACAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAGGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGACAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
TACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCAAGAAAG 
CTCGGGCATC GGCGAATCAG 
CTCGGGCATC GGCAAATCAG 
CTCGGGCATC GGCAAATCAG 
CTCGGGCATC GGCAAATCAG 
TACGGGCATC GGCAAGACAG 
CTCGGGCATC GGCAAGACAG 
CTCGGGAACC GGAAAGGCGG 
ATCTGGTGTT GGCAAATCGG 
TACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 
CACGGGCATC GGCAAGAAAG 
TACGGGCATC GGCAAGAAAG 



2750 

GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCACCAGCC CGCGAGAAAG 
GCCACCAGCC CGCGAGAAAG 
GCCACCAGCC CGCGAGAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCTAAGAAA 
GCCAGCAGCC CGCGAAAAAG 
GCCAGCAGCC CGCTAAAAAG 
GCCAGCAGCC TGCAAGAAAA 
GCAAACAGCC TGCCAGAAAA 
GCCAACAGCC CGCCAGAAAA 
GCCAACAGCC CGCCAGAAAA 
GCCAGCAGCC CGCCAGAAAG 
GCCAGCAGCC CGCGAAAAAG 
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42 2 


AAGCTCAACT 


42~8 


AGACTCAACT 


42 15 


AGACTCAACT 


42~5b 


AGACTCAACT 


42~lb 


AGACTCAACT 


42~13 


AAGCTCAACT 


42_3a 


AAGCTCAACT 


42 4 


AAGCTCAACT 


42 5a 


AAGCTCAACT 


42 10 


AAGCTCAACT 


42 3b 


AAGCTCAACT 


42 11 


AAGCTCAACT 


42 6b 


AGACTCAACT 


43 1 


AGACTGAACT 


4 3™ 5 


AGACTGAACT 


43 12 


AGACTGAACT 


43 20 


AGACTCAATT 


43 21 


AGACTCAATT 


43 23 


AGACTCAATT 


43 25 


AGACTCAATT 


44_1 


AGACTCAACT 


44 5 


AGACTCAACT 


223 10 


AGACTCAACT 


223 2 


AGACTCAACT 


223~~4 


AGACTCAACT 


223~5 


AGACTCAACT 


223~6 


AGACTCAACT 


223 7 


AGACTCAACT 


A3 4 


AGACTCAATT 


A3 5 


AGACTCAATT 


A3~7 


AGACTCAATT 


A3~3 


AGACTCAATT 


4? 7? 


APAPTPA APT 


AAV1 


AGACTCAATT 


AAV 2 


AGATTGAATT 


AAV 3 


AGACTAAATT 


AAV 8 


AGACTCAATT 


AAV 9 


AGACTCAATT 


AAV7 


AGACTCAATT 


44 2 


AGACTCAACT 



Fig. 1AAD 



TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCG 
TTGGGCAGAC TGGCGACTCG 
TTGGGCAGAC TGGCGACTCG 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGTCAGAC TGGCGACACA 
TTGGGCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGAGACGCA 
TCGGTCAGAC TGGAGACTCA 
TTGGTCAGAC TGGCGACTCA 
TTGGTCAGAC TGGCGACTCA 
TCGGTCAGAC TGGCGACTCA 
TTGGGCAGAC TGGCGACTCA 



2800 

GAGTCAGTGC CCGACCCCCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCCCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCACA 
GAGTCAGTCC CCGACCCACA 
GAGTCAGTCC CCGACCCACA 
GAGTCAGTCC CCGACCCACA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGCCAGTCC CCGACCCTCA 
GAGCCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTCC CAGACCCTCA 
GAGTCAGTCC CAGACCCTCA 
GAGTCAGTCC CAGACCCTCA 
GAGTCAGTCC CAGGCCCTCA 
GAGTCAGTGC CCGACCCTCA 
GAGTCAGTCC CCGATCCACA 
GACTCAGTAC CTGACCCCCA 
GAGTCAGTCC CAGACCCTCA 
GAGTCAGTTC CAGACCCTCA 
GAGTCAGTTC CAGACCCTCA 
GAGTCAGTCC CCGACCCTCA 
GAGTCAGTGC CCGACCCTCA 
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vp3 w start 

ACCTCTCGGA GAACCTCCCG CCGCGCCCTC AGGTCTGGGA TCTGGTACAA^ 



ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGCACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCTCTCGGA GAACCTCCCG CCGCGCCCTC AGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 
ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 
ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 
ACCTCTCGGA GAACCTCCAG CAGCCCCCTC AGGTCTGGGA CCTAATACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA "TCTGGTACAA 
ACCAATCGGA GAACCACCAG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTGTGGGA TCTAATACAA 
ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTGTGGGA TCTAATACAA 
ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTGTGGGA TCTAATACAA 
ACCAATCGGA GAACCCCCCG CAGCCCCCTC TGGTGTGGGA TCTAATACAA 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 
ACCTCTCGGA GAACCTCCAG CAACCCCCGC TGCTGTGGGA CCTACTACAA 
GCCTCTCGGA CAGCCACCAG CAGCCCCCTC TGGTCTGGGA ACTAATACGA 
ACCTCTCGGA GAACCACCAG CAGCCCCCAC AAGTTTGGGA TCTAATACAA 
ACCTCTCGGA GAACCTCCAG CAGCGCCCTC TGGTGTGGGA CCTAATACAA 
ACCTCTCGGA GAACCTCCAG CAGCGCCCTC TGGTGTGGGA CCTAATACAA 
ACCTCTCGGA GAACCTCCAG CAGCGCCCTC TAGTGTGGGA TCTGGTACAG 
ACCAATCGGA GAACCCCCCG CAGGCCCCTC TGGTCTGGGA TCTGGTACAA 





vp3 start 
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2851 

^3 start codon 

42_2 TGGCTGCAGG CGGTGGCGCA 

4 2__8 TGGCTGCAGG CGGTGGCGCT 

42_15 TGGCTGCAGG CGGTGGCGCT 

42_5b TGGCTGCAGG CGGTGGCGCT 

42_lb TGGCTGCAGG CGGTGGCGCT 

42_13 TGGCTGCAGG CGGTGGCGCT 

42_3a TGGCTGCAGG CGGTGGCGCT 

42_4 TGGCTGCAGG CGGTGGCGCT 

4 2_5a TGGCTGCAGG CGGTGGCGCA 

42_10 TGGCTGCAGG CGGTGGCGCT 

4 2_3b TGGCTGCAGG CGGTGGCGCT 

4 2_11 TGGCTGCAGG CGGTGGCGCT 

42_6b TGGCTGCAGG CGGTGGCGCT 

4 3_1 TGGCTGCAGG CGGTGGCGCT 

43__5 TGGCTGCAGG CGGTGGCGCT 

4 3_12 TGGCTGCAGG CGGTGGCGCT 

43_20 TGGCTTCAGG CGGTGGCGCT 

4 3_2i TGGCTTCAGG CGGTGGCGCT 

43_23 TGGCTTCAGG CGGTGGCGCT 

43_25 TGGCTTCAGG CGGTGGCGCT 

4 4_1 TGGCTGCAGG CGGTGGCGCT 

4 4_5 TGGCTGCAGG CGGTGGCGCT 

223_10 TGGCTGCAGG CGGTGGCGCA 

223_2 TGGTTGCAGG CGGTGGCGCA 

223_4 TGGCTGCAGG CGGTGGCGCA 

223_5 TGGCTGCAGG CGGTGGCGCA 

223_6 TGGCTGCAGG CGGTGGCGCA 

223_7 TGGCTGCAGG CGGTGGCGCA 

A3_4 TGGCTTCAGG CGGTGGGGCA 

A3_5 TGGCTTCAGG CGGTGGGGCA 

A3_7 TGGCTTCAGG CGGTGGGGCA 

A3_3 TGGCTTCAGG CGGTGGGGCA 

42__12 TGGCTGCAGG CGGTGGCGCT 

AAV1 TGGCTTCAGG CGGTGGCGCA 

AAV2 TGGCTACAGG CAGTGGCGCA 

AAV3 TGGCTTCAGG CGGTGGCGCA 

AAV 8 TGGCTGCAGG CGGTGGCGCA 

AAV 9 TGGCTGCAGG CGGTGGCGCA 

AAV 7 TGGCTGCAGG CGGTGGCGCA 

4 4_2 TGGCTGCAGG CGGTGGCGCT 
vp3 start codon (cont'< 



Fig. 1AAF 

2900 

CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA .AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCTG ACAATAGCGA GGGCGCCGAC 
CCAATGGCTG ACAATAACGA GGGCGCCGAC 
CCAATGGCAG ACGATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA GGGCGCCGAC 
CCAATGGCAG ACAATAACGA GGGTGCCGAT 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
CCAATGGCAG ACAATAACGA AGGTGCCGAC 
CCAATGGCAG ACAATAACGA AGGCGCCGAC 
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42 2 


GGAGTGGGTA 


ATGCCTCCGG 


42 8 


GGAGTGGGTA 


GTTCCTCAGG 


42 15 


GGAGTGGGTA 


GTTCCTCAGG 


42 5b 


GGAGTGGGTA 


GTTCCTCAGG 


42 lb 


GGAGTGGGTA 


GTTCCTCAGG 


42 13 


GGAGTGGGTA 


GTTCCTCAGG 


42 3a 


GGAGTGGGTA 


GTTCCTCAGG 


42 4 


GGAGTGGGTA 


ATGCCTCCGG 


42 5a 


GGAGTGGGTA 


ATGCCTCCGG 


42 10 


GGAGTGGGTA 


ATGCCTCCGG 


42 3b 


GGAGTGGGTA 


ATGCCTCCGG 


42 11 


GGAGTGGGTA 


ATGCCTCCGG 


42 6b 


GGAGTGGGTA 


GTTCCTCAGG 


43 1 


GGAGTGGGTA 


GTTCCTCAGG 


43 5 


GGAGTGGGTA 


GTTCCTCAGG 


43 12 


GGAGTGGGTA 


GTTCCTCAGG 


43 20 


GGAGTGGGTA 


ATTCCTCGGG 


43 21 


GGAGTGGGTA 


ATTCCTCGGG 


43 23 


GGAGTGGGTA 


ATTCCTCGGG 


43 25 


GGAGTGGGTA 


ATTCCTCGGG 


44 1 


GGAGTGGGTA 


GTTCCTCAGG 


44 5 


GGAGTGGGTA 


GTTCCTCAGG 


223 10 


GGAGTGGGTA 


ATGCCTCAGG 


223 2 


GGAGTGGGTA 


ATGCCTCAGG 


223 4 


GGAGTGGGTA 


ATGCCTCAGG 


223 5 


GGAGTGGGTA 


ATGCCTCAGG 


223 6 


GGAGTGGGTA 


ATGCCTCAGG 


223~7 


GGAGTGGGTA 


ATGCCTCAGG 


A3~4 


GGAGTGGGTA 


ATTCCTCGGG 


A3~5 


GGAGTGGGTA 


ATTCCTCGGG 


A3~7 


GGAGTGGGTA 


ATTCCTCGGG 


A3~3 


GGAGTGGGTA 


ATTCCTCGGG 


42 12 


GGAGTGGGTA 


GTTCCTCAGG 


AAV1 


GGAGTGGGTA 


ATGCCTCAGG 


AAV 2 


GGAGTGGGTA 


ATTCCTCCGG 


AAV 3 


GGAGTGGGTA ATTCCTCAGG 


AAV 8 


GGAGTGGGTA 


GTTCCTCGGG 


AAV 9 


GGAGTGGGTA 


ATTCCTCGGG 


AAV 7 


GGAGTGGGTA 


ATGCCTCAGG 


AAV10 


GGTA 


ATTCCTCCGG 


AAV11 


GGTA 


ATTCCTCCGG 


AAV12 


GGTA 


ATTCCTCCGG 


44 2 


GGAGTGGGTA 


GTTCCTCAGG 



Fig. 1AAG 

2950 

AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATAGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CACGGCTGGG 
AAATTGGCAT TGCGATTCCA CACGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGATGGG 
AAATTGGCAT TGCGATTCCC AATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
AAATTGGCAT TGCGATTCCA CATGGCTGGG 
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2951 

CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
GGACAGAGTC 
GGACAGAGTC 
GGACAGAGTC 
GGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
CGACAGAGTC 
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3001 3050 

42 2 ACAACCACCT CTACAAGCAG ATATCAA. . G TCAGAGCGGG GCT....ACC 
4 2~8 ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACC 

4 2__15 ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACC 

4 2~~5b ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACC 

4 2~lb ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACC 

42JL3 ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACC 

42 3a ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACC 

42_4 ACAACCACCT CTACAAGCAG ATATCAA... . GTCAGAGCG GGGC..TACC 

42_5a ACAACCACCT CTACAAGCAG ATATCAA... .GTCAGAGCG GGGC . . TACC 

42~10 ACAACCACCT CTACAAGCAG ATATCAA.. G TCAGAGCGGG GCTA . . . . CC 

42_3b ACAACCACCT CTACAAGCAG ATATCAA G TCAGAGCGGG GCTA....CC 

42JL1 ACAACCACCT CTACAAGCAG ATATCAA. . G TCAGAGCGGG GCTA....CC 

42~6b ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACC 

43 1 ACAACCATCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACT 
4 3~5 ACAACCATCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACT 

43JL2 ACAACCATCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACT 

4 3~20 ACAACCACCT CTACAAGCAA ATCTCCAACG GCACCTCGGG AGGAAGCACC 

4 3~21 ACAACCACCT CTACAAGCAA ATCTCCAACG GCACCTCGGG AGGAAGCACC 

43~23 ACAACCACCT CTACAAGCAA ATCTCCAACG GCACCTCGGG AGGAAGCACC 

4 3~25 ACAACCACCT CTACAAGCAA ATCTCCAACG GCACCTCGGG AGGAAGCACC 

44_1 ACAACCACCT CTACAAGCAA ATCTCCAACG GGACTTCGGG AGGAAGCACC 

44~5 ACAACCACCT CTACAAGCAA ATCTCCAACG GGACTTCGGG AGGAAGCACC 

223_10 ACAACCACCT CTACAAGCAA ATCTCCAGTC AGTCAGCAGG GAG...CACC 

223 2 ACAACCACCT CTACAAGCAA ATCTCCAGTC AGTCAGCAGG GAG...CACC 

223~4 ACAACCACCT CTACAAGCAA ATCTCCAGTC AGTCAGCAGG GAG...CACC 

223~5 ACAACCACCT CTACAAGCAA ATCTCCAGTC AGTCAGCAGG GAG . . . CACC 

223~6 ACAACCACCT CTACAAGCAA ATCTCCAGTC AGTCAGCAGG GAG... CACC 

223J7 ACAACCACCT CTACAAGCAA ATCTCCAGTC AGTCAGCAGG GAG... CACC 

A3~4 ATAATCACCT CTACAAGCAA ATCTCCA. . . GCGAATCGGG AGC...CACC 

A3~5 ATAATCACCT CTACAAGCAA ATCTCCA... GCGAATCGGG AGC . . . CACC 

A3~7 ATAATCGCCT CTACAAGCAA ATCTCCA... GCGAATCGGG AGC ... CACC 

A3~3 ATAATCACCT CTACAAGCAA ATCTCCA.:. GCGAATCGGG AGC... CACC 

4 2 12 ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAAGCACC 

AAV1 ATAACCACCT CTACAAGCAA ATCTCCAGTC CTTCAACGGG . GG . . CCAGC 

AAV2 ACAACCACCT CTACAAACAA ATTTCCA. . . GCCAATCAGG AGC . . . CTCG 

AAV3 ACAACCATCT CTACAAGCAA ATCTCCA. . . GCCAATCAGG AGC. . . TTCA 

AAV8 ACAACCACCT CTACAAGCAA ATCTCCAACG GGACATCGGG AGGAGCCACC 

AAV9 ACAACCACCT CTACAAGCAA ATCTCCAATG GAACATCGGG AGGAAGCACC 

AAV 7 ACAACCACCT CTACAAGCAA ATCTCCAGTG AAACTGCAGG TAG... TACC 

AAV10 ACAACCACAT CTACAAGCAA ATCTCCAGCG AGACAGGAGC CACCAACGAC 

AAV11 ACAACCACCT CTACAAACAA ATCTCCAGCG CTTCAACGGG GGCCAGCAAC 

AAV 12 ACAACCACCT CTACAAGCAA ATCTCCAGCC AATCGGGTGC CACCAACGAC 

44 2 ACAACCACCT CTACAAGCAA ATCTCCAACG GGACTTCGGG AGGAAGCACC 



61/105 



WO 03/042397 



PCT/US02/33629 





3051 




42 2 


AACGACAACC 


ACTTCTTCGG 


42 8 


AACGACAACA 


CCTACTTCGG 


42 15 


AACGACAACA 


CCTACTTCGG 


42 5b 


AACGACAACA 


CCTACTTCGG 


42 lb 


AACGACAACA 


CCTACTTCGG 


42 13 


AACGACAACA 


CCTACTTCGG 


42 3a 


AACGACAACA 


CCTACTTCGG 


42 4 


AACGACAACC 


ACTTCTTCGG 


42 5a 


AACGACAACC 


ACTTCTTCGG 


42 10 


AACGACAACC 


ACTTCTTCGG 


42 3b 


AACGACAACC 


ACTTCTTCGG 


42 11 


AACGACAACC 


ACTTCTTCGG 


42 6b 


AACGACAACA 


CCTACTTCGG 


43 1 


AACGACAACA 


CCTACTTTGG 


43 5 


AACGACAACA 


CCTACTTTGG 


43 12 


AACGACAACA 


CCTACTTTGG 


43 20 


AACGACAACA 


CCTATTTTGG 


43 21 


AACGACAACA 


CCTATTTTGG 


43 23 


AACGACAACA 


CCTATTTTGG 


43 25 


AACGACAACA 


CCTATTTTGG 


44 1 


AACGACAACA 


CCTACTTCGG 


44 5 


AACGACAACA 


CCTACTTCGG 


223 10 


AACGATAACG 


TCTATTTCGG 


223 2 


AACGATAACG 


TCTATTTCGG 


223 4 


AACGATAACG 


TCTATTTCGG 


223 5 


AACGATAACG 


TCTATTTCGG 


223 6 


AACGATAACG 


TCTATTTCGG 


223 7 


AACGATAACG 


TCTATTTCGG 


A3 4 


AACGACAACC 


ACTACTTCGG 


A3 5 


AACGACAACC 


ACTACTTCGG 


A3 7 


AACGACAACC 


ACTACTTCGG 


A3 3 


AACGACAACC 


ACTACTTCGG 


42 12 


AACGACAACA 


CCTACTTCGG 


AAV1 


AACGACAACC 


ACTACTTCGG 


AAV 2 


AACGACAATC 


ACTACTTTGG 


AAV 3 


AACGACAACC 


ACTACTTTGG 


AAV 8 


AACGACAACA 


CCTACTTCGG 


AAV 9 


AACGACAACA 


CCTACTTTGG 


AAV7 


AACGACAACA 


CCTACTTCGG 


AAV10 


AACCACTACT 


TCGGCTACAG 


AAV11 


. . . GACAACC 


ACTACTTTGG 


AAV12 


AACCACTACT 


TCGGCTA. . . 


44 2 


AACGACAACA 


CCTACTTCGG 



Fig. 1AAJ 

3100 

CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGCT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGATTT 
CTACAGCACC CCTTGGGGGT ATTTTGACTT 
CTACAGCACC CCTTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 

C ACC CCCTGGGGGT ATTTTGACTT 

CTACAGCACC CCCTGGGGGT ATTTTGACTT 
. . . CAGCACC CCTTGGGGGT ATTTTGATTT 
CTACAGCACC CCCTGGGGGT ATTTTGACTT 
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Fig. 1AAK 



CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCATCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGTCACT TTTCACCACG 
CACTGTCACT TTTCACCACG 
CACTGTCACT TTTCACCACG 
CACTGTCACT TTTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CATTGCCACT TCTCACCACG 
CACTGTCACT TCTCACCACG 
CACTGTCACT TCTCACCACG 
CACTGTCACT TCTCACCACG 
CACTGTCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TTTCACCACG 
CACTGCCACT TTTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TTTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCACT TTTCACCACG 
CACTGCCACT TCTCACCACG 
CACTGCCATT TCTCACCACG 
CACTGCCACT TCTCACCACG 



3150 

TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAA CGACTCATCA 
TGACTGGCAA CGACTCATCA 
TGACTGGCAA CGACTCATCA 
TGACTGGCAA CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTTATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAA AGACTCATCA 
TGACTGGCAG CGACTCATTA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
TGACTGGCAG CGACTCATCA 
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Fig . 1AAL 

3200 

CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAGAAAGC TGCGGTTCAA GTTGTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAAAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAAAGAC TCAACTTCAA GCTGTTCAAC 
.CCCAAAAGAC TCAACTTCAA GCTGTTCAAC 
CCCAAAAGAC TCAACTTCAA GCTGTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC CCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC - 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAGC TCAACTTCAA GCTCTTCAAC 
CCCAAGAAAC TCAATTTCAA GCTCTTCAAC 
CCCAAGAAAC TCAATTTCAA GCTCTTCAAC 
CCCAAGAAAC TCAATTTCAA GCTCTTCAAC 
CCCAAGAAAC TCAATTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAACTTCAA ACTCTTCAAC 
CCCAAGAGAC TCAACTTCAA GCTCTTTAAC 
CCCAAGAAAC TCAGCTTCAA GCTCTTCAAC 
CCCAAGAGAC TCAGCTTCAA GCTCTTCAAC 
CCAAAGAGAC TCAACTTCAA GCTGTTCAAC 
CCCAAGAAGC TGCGGTTCAA GCTCTTCAAC 



CCCAAGAGAC TCAACTTCAA GCTCTTCAAC 
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Fig. 1AAM 



AGGAGGTCAC GACGAACGAC 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GACGAACGAC 

AGGAGGTCAC GACGAACGAC 

AGGAGGTCAC GACGAACGAC 

AGGAGGTCAC GACGAACGAC 

AGGAGGTCAC GACGGACGAC 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAAGTCAC GACGAACGAA 

AGGAAGTCAC GACGAACGAA 

AGGAAGTCAC GACGAACGAA 

AGGAAGTCAC GACGAACGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GACGAATGAC 

AGGAGGTCAC GACGAATGAC 

AGGAGGTCAC GACGAATGAC 

AGGAGGTCAC GACGAATGAC 

AGGAGGTCAC GACGAATGAC 

AGGAGGTCAC GACGAATGAC 

AGGAGGTCAC GCAGAATGAT 

AGGAGGTCAC GCAGAATGAT 

AGGAGGTCAC GCAGAATGAT 

AGGAGGTCAC GCAGAATGAT 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTCAC GACGAATGAT 

AAGAGGTCAC GCAGAATGAC 

GAGGGGTCAC GCAGAACGAT 

AGGAGGTCAC GCAGAATGAA 

AGGAGGTTAC GACGAACGAA 

AGGAGGTCAC GACGAATGAC 

AGGAGGTCAC GCAGAATGAA 



3250 

GGCGTTACGA CCATCGCTAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCTAA 
GGCGTTACGA CCATCGCTAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGTGTCACAA CCATCGCTAA 
GGTGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGTGTCACAA CCATCGCTAA 
GGCGTCACAA CCATCGCTAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGAACCACGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTCACAA CCATCGCTAA 
GGTACGACGA CGATTGCCAA 
GGCACGACGA CTATTGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCACCAAGA CCATCGCCAA 
GGCGTTACGA CCATCGCTAA 
GGCACCAAGA CCATCGCCAA 
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42_2 
4 2_8 
42_15 
42_5b 
42_lb 
42_13 
42_3a 
42__4 
42_5a 
42_10 
42_3b 
42JL1 
42_6b 
4 3_1 
4 3_5 
43_12 
43_20 
43_21 
43_23 
4 3_25 
44_1 
4 4_5 
223_10 
223_2 
223_4 
223_5 
223__6 
223_7 
A3_4 
A3_5 
A3J7 
A3_3 
42_12 
AAV1 
AAV2 
AAV 3 
AAV 8 
AAV 9 
AAV 7 
44 2 



3251 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAATCTCACC 

TAATCTCACC 

TAATCTCACC 

TAATCTCACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 

TAACCTCACC 

TAACCTTACC 

TAACCTTACC 

TAACCTTACC 



AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC. 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACGATTC 

AGCACCGTGC 

AGCACCGTGC 

AGCACCGTGC 

AGCACCGTGC 

AGCACGATTC 

AGCACGATTC 

AGCACGGTTC 

AGCACGGTTC 

AGCACGGTTC 

AGCACGGTTC 

AGCACGGTTC 

AGCACGGTTC 

AGCACGGTGC 

AGCACGGTGC 

AGCACGGTGC 

AGCGCGGTGC 

AGCACGATTC 

AGCACGGTTC 

AGCACGGTTC 

AGCACGGTTC 

AGCACCATCC 

AGCACCGTCC 

AGCACGATTC 

AGCACGATTC 



Fig. 1AAN 



AGGTCTTCTC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTCTC 

AGGTCTTCTC 

AGGTCTTCTC 

AGGTCTTCTC 

AGGTCTTCTC 

AGGTGTTTAC 

AGGTGTTTAC 

AGGTGTTTAC 

AGGTCTTTAC 

GGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTAC 

AGGTCTTTTC 

AGGTCTTTTC 

AGGTCTTTTC 

AGGTCTTTTC 

AGGTCTTTTC 

AGGTCTTTTC 

AGGTCTTCAC 

AGGTCTTCAC 

AGGTCTTCAC 

AGGTCTTCAC 

AGGTCTTTAC 

AAGTCTTCTC 

AGGTGTTTAC 

AAGTGTTTAC 

AGGTGTTTAC 

AGGTCTTTAC 

AGGTATTCTC 

AGGTCTTTAC 



GGACTCGGAG 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAG 

GGACTCGGAG 

GGACTCGGAG 

GGACTCGGAG 

GGACTCGGAG 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAG 

GGACTCGGAG 

GGACTTGGAG 

GGACTCGGAG 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACTCGGAA 

GGACCCGGAA 

AGACTCTGAG 

AGACTCTGAG 

AGACTCTGAG 

AGACTCTGAG 

GGACTCGGAA 

GGACTCGGAG 

TGACTCGGAG 

GGACTCGGAG 

GGACTCGGAG 

GGACTCGGAG 

GGACTCGGAA 

GGACTCGGAA 



3300 
TACCAACTGC 
TACCAGCTCC 
TACCAGCTCC 
TACCAGCTCC 
TACCAGCTCC 
TACCAGCTCC 
TACCAGCTCC 
TACCGGCTCC 
TACCAACTGC 
TACCAACTGC 
TACCAACTGC 
TACCAACTGC 
TACCAACTGC 
TACCAGCTCC 
TACCAGCTCC 
TACCAGCTCC 
TACCAGTTAC 
TACCAGTTAC 
TACCAGTTAC 
TACCAGTTAC 
TACCAGCTCC 
TACCAGCTCC 
TATCAACTGC 
TATCAACTGC 
TATCAACTGC 
TATCAACTGC 
TATCAACTGC 
TATCAACTGC 
TACCAGCTGC 
TACCAGCTGC 
TACCAGCTGC 
TACCAGCTGC 
TACCAGCTCC 
TACCAGCTTC 
TACCAGCTCC 
TATCAGCTCC 
TACCAGCTGC 
TACCAGCTAC 
TACCAGCTGC 
TACCAGCTCC 
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3301 


42 2 


CGTACGTCCT 


42 8 


CGTACGTCCT 


42 15 


CGTACGTCCT 


42 5b 


CGTACGTCCT 


42 lb 


CGTACGTCCT 


42 13 


CGTACGTCCT 


42 3a 


CGTACGTCCT 


42 4 


CGTACGTCCT 


42 5a 


CGTACGTCCT 


42 10 


CGTACGTCCT 


42 3b 


CGTACGTCCT 


42 1 


CGTACGTCCT 


42 6b 


CGTACGTCCT 


43 1 


CGTACGTCCC 


43 5 


CGTACGTCCT 


43 12 


CGTACGTCCT 


43 20 


CGTACGTGCT 


43 21 


CGTACGTGCT 


43 23 


CGTACGTGCT 


43 25 


CGTACGTGCT 


44 1 


CGTACGTCCT 


44 5 


CGTACGTCCT 


223 10 


CGTACGTCCT 


223 2 


CGTACGTCCT 


223 4 


CGTACGTCCT 


223 5 


CGTACGTCCT 


223 6 


CGTACGTCCT 


223 7 


CGTACGTCCT 


A3 4 


CCTACGTCCT 


A3 5 


CGTACGTCCT 


A3 7 


CCTACGTCCT 


A3 3 


CCTACGTCCT 


42__12 


CGTACGTCCT 


AAV1 


CGTACGTCCT 


AAV2 


CGTACGTCCT 


AAV3 


CGTACGTGCT 


AAV8 


CGTACGTTCT 


AAV9 


CGTACGTCCT 


AAV7 


CGTACGTCCT 


44 2 


CGTACGTCCT 



Fig. 1AAO 



CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
AGGATCCGCT CACCAGGGAT 
AGGATCCGCT CACCAGGGAT 
AGGATCCGCT CACCAGGGAT 
AGGATCCGCT CACCAGGGAT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGCTCCGCG CACCAGGGCT 
CGGTTCGGCT CACCAGGGCT 
CGGTTCGGCT CACCAGGGCT 
CGGTTCGGCT CACCAGGGCT 
CGGTTCGGCT CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCGGCG CATCAAGGAT 
CGGGTCGGCG CACCAAGGCT 
CGGCTCTGCC CACCAGGGCT 
AGGCTCTGCC CACCAAGGAT 
CGGCTCTGCG CACCAGGGCT 
CGGCTCTGCG CACCAGGGCT 



3350 

GCCTCCCTCC GTTCCCTGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCCGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCTGCG 
GCCTCCCTCC GTTCCCGGCG 
GCCTCCCTCC GTTCCCGGCG 
GCCTCCCTCC GTTCCCGGCG 
GTCTGCCTCC GTTCCCGGCG 
GTCTGCCTCC GTTCCCGGCG 
GTCTGCCTCC GTTCCCGGCG 
GTCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTGCCTCC GTTCCCGGCA 
GCCTTCCGCC GTTCCCAGCA 
GCCTTCCGCC GTTCCCAGCA 
GCCTTCCGCC GTTCCCAGCA 
GCCTTCCGCC GTTCCCAGCA 
GCCTGCCTCC GTTCCCGGCG 
GCCTCCCTCC GTTCCCGGCG 
GCCTCCCGCC GTTCCCAGCA 
GTCTCCCGCC GTTTCCAGCG 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCACC GTTTCCTGCA 
GCCTGCCTCC GTTCCCGGCG 
GCCTGCCTCC GTTCCCGGCG 
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3351 



42 2 


GACGTGTTCA 


TGATTCCTCA 


42 8 


GACGTCTTCA 


TGATTCCTCA 


42 15 


GACGTCTTCA 


TGATTCCTCA 


42 5b 


GACGTCTTCA 


TGATTCCTCA 


42_lb 


GACGTCTTCA 


TGATTCCTCA 


42_13 


GACGTCTTCA 


TGATTCCTCA 


42_3a 


GACGTCTTCA 


TGATTCCTCA 


42 4 


GACGTCTTCA 


TGATTCCTCA 


42_5a 


GACGTGTTCA 


TGATTCCTCA 


42_10 


GACGTGTTCA 


TGATTCCTCA 


42 3b 


GACGTGTTCA 


TGATTCCTCA 


42 1 


GACGTGTTCA 


TGATTCCTCA 


42 6b 


GACGTGTTCA 


TGATTCCTCA 


43 1 


GACGTCTTCA 


TGATTCCTCA 


43 5 


GACGTCTTCA 


TGATTCCTCA 


43_12 


GACGTCTTCA 


TGATTCCTCA 


43__20 


GACGTCTTCA 


CGGTTCCTCA 


43_21 


GACGTCTTCA 


TGGTTCCTCA 


4 3__23 


GACGTCTTCA 


TGGTTCCTCA 


4 3_25 


GACGTCTTCA 


TGGTTCCTCA 


4 4_1 


GACGTCTTCA 


TGATTCCTCA 


4 4_5 


GACGTCTTCA 


TGATTCCTCA 


223_10 


GACGTGTTCA 


TGATTCCGCA 


223_2 


GACGTGTTCA 


TGATTCCGCA 


223 4 


GACGTGTTCA 


TGATTCCGCA 


223 5 


GACGTGTTCA 


TGATTCCGCA 


223 6 


GACGTGTTCA 


TGATTCCGCA 


223 7 


GACGTGTTCA 


TGATTCCGCA 


A3 4 


GACGTCTTCA 


TGATTCCTCA 


A3 5 


GACGTCTTCA 


TGATTCCTCA 


A3 7 


GACGTCTTCA 


TGATTCCTCA 


A3 3 


GACGTCTTCA 


TGATTCCTCA 


42 12 


GACGTCTTCA 


TGATTCCTCA 


AAV1 


GACGTGTTCA 


TGATTCCGCA 


AAV2 


GACGTCTTCA 


TGGTGCCACA 


AAV3 


GACGTCTTCA 


TGGTCCCTCA 


AAV8 


GACGTGTTCA 


TGATTCCCCA 


AAV 9 


GACGTCTTCA 


TGGTTCCTCA 


AAV 7 


GACGTCTTCA 


TGATTCCTCA 


44 2 


GACGTCTTCA 


TGATTCCTCA 



Fig. 1AAP 

3400 

GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGATAT CTGACTCTAA ACAACGGCAG 
GTACGGGTAT CTGACCCTAA ACAATGGCAG 
GTACGGGTAT CTGACCCTAA ACAATGGCAG 
GTACGGGTAT CTGACCCTAA ACAATGGCAG 
GTACGGCTAT TTAACTTTAA ACAATGGAAG 
GTACGGCTAT TTAACTTTAA ACAATGGAAG 
GTACGGCTAT TTAACTTTAA ACAATGGAAG 
GTACGGCTAT TTAACTTTAA ACAATGGAAG 
GTACGGGTAC CTGACTCTGA ACAATGGCAG 
GTACGGGTAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGATAC CTGACTCTGA ACAATGGCAG 
GTACGGCTAC TTGACTCTGA ACAATGGCAG 
GTACGGCTAC TTGACTCTGA ACAATGGCAG 
GTACGGCTAC TTGACTCTGA ACAATGGCAG 
GTACGGCTAC TTGACTCTGA ACAATGGCAG 
GTACGGGTAC CTGACTCTGA ACAACGGCAG 
ATACGGCTAC CTGACGCTCA ACAATGGCAG 
GTATGGATAC CTCACCCTGA ACAACGGCAG 
GTATGGATAC CTCACCCTGA ACAACGGAAG 
GTACGGCTAC CTAACACTCA ACAACGGTAG 
GTACGGCTAC CTGACGCTCA ACAATGGAAG 
GTACGGCTAC CTGACTCTCA ACAATGGCAG 
GTACGGGTAC CTGACTCTGA ACAATGGCAG 
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3401 




42 2 


TCAGTCTGTG 


GGACGTTCCT 


42 8 


TCAGGCCGTG 


GGCCGTTCCT 


42 15 


TCAGGCCGTG 


GGCCGTTCCT 


42 5b 


TCAGGCCGTG 


GGCCGTTCCT 


42 lb 


TCAGGCCGTG 


GGCCGTTCCT 


42 13 


TCAGGCCGTG 


GGCCGTTCCT 


42 3a 


TCAGGCCGTG 


GGCCGTTCCT 


42 4 


TCAGGCCGTG 


GGCCGTTCCT 


42 5a 


TCAGTCTGTG 


GGACGTTCCT 


42 10 


TCAGTCTGTG. 


GGACGTTCCT 


42 3b 


TCAGTCTGTG 


GGACGTTCCT 


42 11 


TCAGTCTGTG 


GGACGTTCCT 


42 6b 


TCAGTCTGTG 


GGACGTTCCT 


43 1 


TCAGGCTGTG 


GGCCGTTCCT 


43 5 


TCAGGCTGTG 


GGCCGTTCCT 


43 12 


TCAGGCTGTG 


GGCCGTTCCT 


43 20 


CCAAGCCCTG 


GGACGTTCCT 


43 21 


CCAAGCCCTG 


GGACGTTCCT 


43 23 


CCAAGCCCTG 


GGACGTTCCT 


43 25 


CCAAGCCCTG 


GGACGTTCCT 


44 1 


TCAGGCCGTG 


GGCCGTTCCT 


44 5 


TCAGGCCGTG 


GGCCGTTCCT 


223 10 


CCAATCGGTA 


GGCCGTTCCT 


223 2 


CCAATCGGTA 


GGCCGTTCCT 


223 4 


CCAATCGGTA 


GGCCGTTCCT 


223 5 


CCAATCGGTA 


GGCCGTTCCT 


223 6 


CCAATCGGTA 


GGCCGTTCCT 


223 7 


CCAATCGGTA 


GGCCGTTCCT 


A3 4 


CCAAGCGGTA 


GGACGTTCTT 


A3 5 


CCAAGCGGTA 


GGACGTTCTT 


A3 7 


CCAAGCGGTA 


GGACGTTCTT 


A3 3 


CCAAGCGGTA 


GGACGTTCTT 


42_12 


TCAGGCCGTG 


GGCCGTTCCT 


AAV1 


CCAAGCCGTG 


GGACGTTCAT 


AAV 2 


TCAGGCAGTA 


GGACGCTCTT 


AAV 3 


TCAAGCGGTG 


GGACGCTCAT 


AAV 8 


TCAGGCCGTG 


GGACGCTCCT 


AAV9 


TCAAGCGTTA 


GGACGTTCTT 


AAV 7 


TCAGTCTGTG 


GGACGTTCCT 


44 2 


TCAGGCCGTG 


GGCCGTTCCT 



Fig. 1AAQ 

3450 

CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAATAC TTCCCTTCTC 
CCTTCTACTG CCTGGAATAC TTCCCTTCTC 
CCTTCTACTG CCTGGAATAC TTCCCTTCTC 
CCTTCTACTG TCTGGAGTAT TTCCCATCGC 
CCTTCTACTG TCTGGAGTAT TTCCCATCGC 
CCTTCTACTG TCTGGAGTAT TTCCCATCGC 
CCTTCTACTG TCTGGAGTAT TTCCCATCGC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CATTCTACTG TCTAGAGTAT TTTCCCTCTC 
CATTCTACTG TCTAGAGTAT TTTCCCTCTC 
CATTCTACTG TCTAGAGTAT TTTCCCTCTC 
CATTCTACTG TCTAGAGTAT TTTCCCTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTTTACTG CCTGGAATAT TTCCCTTCTC 
CATTTTACTG CCTGGAGTAC TTTCCTTCTC 
CCTTTTACTG CCTGGAGTAC TTCCCTTCGC 
CCTTCTACTG CCTGGAATAC TTTCCTTCGC 
CTTTCTACTG TCTGGAATAC TTCCCTTCTC 
CCTTCTACTG CCTGGAGTAC TTCCCCTCTC 
CCTTCTACTG CCTGGAGTAC TTTCCTTCTC 
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42 2 


AGATGCTGAG 


42 8 


AAATGCTGAG 


42 15 


AAATGCGGAG 


42 5b 


AAATGCTGAG 


42 lb 


AAATGCTGAG 


42 13 


AAATGCTGAG 


42 3a 


AAATGCTGAG 


42 4 


AAATGCTGAG 


42 5a 


AGATGCTGAG 


42 10 


AGATGCTGAG 


42 3b 


AGATGCTGAG 


42 11 


AGATGCTGAG 


42 6b 


AGATGCTGAG 


43 1 


AAATGCTGAG 


43 5 


AAATGCTGAG 


43 12 


AAATGCTGAG 


43 20 


AGATGCTGAG 


43 21 


AGATGCTGAG 


43 23 


AGATGCCGAG 


43 25 


AGATGCTGAG 


44 1 


AAATGCTGAG 


44 5 


AAATGCTGAG 


223 10 


AGATGCTGAG 


223 2 


AGATGCTGAG 


223 4 


AGATGCTGAG 


223 5 


AGATGCTGAG 


223 6 


AGATGCTGAG 


223 7 


AGATGCTGAG 


A3 4 


AGATGCTGAG 


A3 5 


AGATGCTGAG 


A3 7 


AGATGCTGAG 


A3 3 


AGATGCTGAG 


42 12 


AAATGCTGAG 


AAV1 


AGATGCTGAG 


AAV2 


AGATGCTGCG 


AAV 3 


AGATGCTAAG 


AAV 8 


AGATGCTGAG 


AAV 9 


AGATGCTGAG 


AAV 7 


AGATGCTGAG 


44 2 


AAATGCTGAG 



Fig. 1AAR 



AACGGGCAAT AACTTTGAAT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
AACGGGCAAT AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
GACGGGCAAC AACTTTGAAT 
AACCGGCAAC AACTTTCAGT 
AACCGGCAAC AACTTTCAGT 
AACCGGCAAC AACTTTCAGT 
AACCGGCAAC AACTTTCAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
AACGGGCAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
GACGGGAAAC AACTTCACCT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTACCT 
TACCGGAAAC AACTTTACCT 
GACTGGAAAT AACTTCCAAT 
AACCGGCAAC AACTTCCAGT 
AACCGGCAAC AACTTTCAGT 
AACGGGCAAC AACTTTGAGT 
AACGGGCAAC AACTTTGAGT 



3500 

TCAGCTACAC CTTTGAGGAA 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACAC CTTCGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACCA GTTTGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TTAGCTACAC CTTCGAGGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACAC TTTTGAAGAC 
TCAGCTACCA GTTTGAGGAC 
TCAGCTACAC CTTTGAGGAA 
TCAGCTACAC TTTTGAGGAC 
TCAGCTATAC CTTCGAGGAT 
TTACTTACAC CTTCGAGGAC 
TCAGCTACAC TTTCGAGGAC 
TCAGCTACAG CTTCGAGGAC 
TCAGCTACCA GTTTGAGGAC 
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ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCATAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCACAGC 
ACAGCAGCTA TGCGCATAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCGCACAGC 
ACAGCAGCTA CGCTCACAGC 
ACAGCAGCTA CGCTCACAGC 
ACAGCAGCTA CGCCCACAGC 
ACAGCAGCTA CGCACACAGC 
ACAGCAGCTA CGCACACAGC 
ACAGCAGCTA CGCGCACAGC 
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CAGAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAGAGCCTGG ACAGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG GCCGGCTGAT 
CAGAGTCTGG GCCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAGAGTCTGG ATCGGCTGAT 
CAAAGCCTGG ACCGGCTGAC 
CAGAGCCTGG ACCGGCTGAT 
CAGAGTCTGG ACCGTCTCAT 
CAGAGTTTGG ATCGCTTGAT 
CAGAGCTTGG ACCGGCTGAT 
CAGAGTCTAG ATCGACTGAT 
CAGAGCCTGG ACCGGCTGAT 
CAAAGCCTGG ACCGGCTGAT 
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Fig. 1AAT 



ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTATTA 
ATCGACCAGT ACCTGTATTA 
ATCGACCAGT ACCTGTATTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTGTACTA 
ATTGACCAGT ACCTGTATTA 
ATTGACCAGT ACCTGTATTA 
ATTGACCAGT ACCTGTATTA 
ATTGACCAGT ACCTGTATTA 
ATCGACCAGT ACCTGTACTA 
ATCGACCAAT ACCTGTATTA 
ATCGACCAGT ACCTGTATTA 
ATTGATCAGT ATCTGTACTA 
ATTGACCAGT ACCTGTACTA 
ATCGACCAGT ACCTATACTA 
ATCGACCAGT ACTTGTACTA 
ATCGACCAGT ACCTGTACTA 
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CCTGGCCCGG ACCCAGAGCA 
CCTGTCTCGG ACTCAGTCCA 
CCTGTCTCGG ACTCAGTCCA 
CCTGTCTCGG ACTCAGTCCA 
CCTGTCTCGG ACTCAGTCCA 
CCTGTCTCGG ACTCAGTCCA 
CCTGTCTCGG ACTCAGTCCA 
CCTGTCTCGG ACTCAGTCCA 
CCTGTCTCGG ACTCAGTCCA 
CCTGGCCCGG ACCCAGAGCA 
CCTGGCCCGG ACCCAGAGCA 
CCTGGCCCGG ACCCAGAGCA 
CCTGGCCCGG ACCCAGAGCA 
CTTATCCAGA ACTCAGTCCA 
CTTATCCAGA ACTCAGTCCA 
CTTATCCAGA ACTCAGTCCA 
CCTGGTCAGA ACGCAAACGA 
CCTGGTCAGA ACGCAAACGA 
CCTGGTCAGA ACGCAAACGA 
CCTGGTCAGA ACGCAAACGA 
CCTGTCTCGG ACTCAGTCCA 
CCTGTCTCGG ACTCAGTCCA 
CTTGGCCAGA ACACAGAGCA 
CTTGGCCAGA ACACAGAGCA 
CTTGGCCAGA ACACAGAGCA 
CTTGGCCAGA ACACAGAGCA 
CTTGGCCAGA ACACAGAGCA 
CTTGGCCAGA ACACAGAGCA 
CCTGAGCAAA ACTCAGGGTA 
CCTGAGCAAA ACTCAGGGTA 
CCTGAGCAAA ACTCAGGGTA 
CCTGAGCAAA ACTCAGGGTA 
CCTGGCCCGG ACCCAGAGCA 
CCTGAACAGA ACTCAAA . AT 
CTTGAGCAGA ACAAACACTC 
CCTGAACAGA ACGCAAGGAA 
CTTGTCTCGG ACTCAAACAA 
CCTGGTCAGA ACACAGACAA 
CCTGGCCAGA ACACAGAGTA 
CCTGTCTCGG ACTCAGTCCA 
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Fig. 1AAU 



TCCACAAGGG AGCTGCA.GT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
GTCCACAAGG GAGCTGCAGT 
GTCCACAAGG GAGCTGCAGT 
GTCCACAAGG GAGCTGCAGT 
GTCCACAAGG GAGCTGCAGT 
AACTCAAGGT ACTCAGCAAT 
AACTCAAGGT ACTCAGCAAT 
AACTCAAGGT ACTCAGCAAT 
AACTGGAGGG ACGCAGACTC 
AACTGGAGGG ACGCAGACTC 
AACTGGAGGG ACGCAGACTC 
AACTGGAGGG ACGCAGACTC 
TACCGCAGGA ACTCAGCAGT 
TACCGCAGGA ACTCAGCAGT 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
TACTGCTGGC AATCGGGAAC 
AACAACGCAG CAATCGAGAC 
AACAACGCAG CAATCGAGAC 
AACAACGCAG CAATCGAGAC 
AACAACGCAG CAATCGAGAC 
GTCCACAAGG GGGCTGCAGT 
AAGTGCCCAA AACAAGGACT 
AACCACCACG CAGTCAAGGC 
AACAACCAAC CAATCACGGC 
, ACGGCAAAT ACGCAGACTC 
. ACTGGGGGA ACTCAAACTT 
CACAGCTGGC AATCGGGAAC 
TACCGCAGGA ACTCAGCAGT 
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TCCA TCAGGCTGGG 

TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 



TCCA TCAGGCTGGG 

TCCA TCAGGCTGGG 

TCCA TCAGGCTGGG 

TCCA TCAGGCTGGG 



TGTTATTTTC TCAAGCCGGG 
TGTTATTTTC TCAAGCCGGG 
TGTTATTTTC TCAAGCCGGG 
TGGCATTCAG CCAAGCGGGT 
TGGCATTCAG CCAAGCGGGT 
TGGCATTCAG CCAAGCGGGT 
TGGCATTCAG CCAAGCGGGT 
TGCTATTTTC TCAGGCCGGG 
TGCTATTTTC TCAGGCCGGG 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTTTA TCAGGGCGGA 
TGCAGTTCAG CCAAGCTGGG 
TGCAGTTCAA CCAAGCTGGG 
TGCAGTTCAG CCAAGCTGGG 
TGCAGTTCAG CCAAGCTGGG 

TCCA TCAGGCTGGG 

TGCTGTTTAG CCGTGGGTCT 
TTCAGTTTTC TCAGGCCGGA 
TGCTTTTTAG CCAGGCTGGG 
TGGGCTTCAG CCAAGGTGGG 
TGGCATTCAG CCAAGCAGGC 
TGCAGTTTTA CCAGGGCGGG 
TGCTATTTTC TCAGGCCGGG 
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Fig. 1AAV 



TGGCCGAGCA ATCAAAGAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGGCCGAGCA ATCAAAGAAC 
TGGCCGAGCA ATCAAAGAAC 
TGGCCGAGCA ATCAAAGAAC 
TGGCCGAGCA ATCAAAGAAC 
TGTCGGCTCA GGCCAAGAAC 
TGTCGGCTCA GGCCAAGAAC 
TGTCGGCTCA GGCCAAGAAC 
TGGCCAACCA GGCTAGAAAT 
TGGCCAACCA GGCTAGAAAT 
TGGCCAACCA GGCTAGAAAT 
TGGCCAACCA GGCTAGAAAT 
TGTCGGCTCA GGCCAAAAAC 
TGTCGGCTCA GGCCAAAAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCCGAACA AGCAAAGAAC 
TGGCTCAGCA GGCCAAAAAC 
TGGCTCAGCA GGCCAAAAAC 
TGGCTCAGCA GGCCAAAAAC 
TGGCTCAGCA GGCCAAAAAC 
TGGCCGAGCA ATCAAAGAAC 
TGTCTGTTCA GCCCAAAAAC 
TTCGGGACCA GTCTAGGAAC 
TGTCTTTGCA GGCCAGAAAT 
TGGCCAATCA GGCAAAGAAC 
TGGCCAATCA GGCTAGAAAC 
TGGCCGAACA AGCCAAGAAT 
TGTCGGCTCA GGCCAAAAAC 
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TGGCTGCCCG GACCCTGTTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTACCTG GACCGTGTTA 
TGGCTACCTG GACCGTGTTA 
TGGCTACCTG GACCGTGTTA 
TGGGTGCCCG GACCTTGCTA 
TGGGTGCCCG GACCTTGCTA 
TGGGTGCCCG GACCTTGCTA 
TGGGTGCCCG GACCTTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTACCCG GGCCCTGCTA 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTGCCCG GACCTTGCTT 
TGGCTACCGG GACCCAGCTA 
TGGCTACCGG GACCCAGCTA 
TGGCTACCGG GACCCAGCTA 
TGGCTACCGG GACCCAGCTA 
TGGCTGCCCG GACCCTGTTA 
TGGCTACCTG GACCCTGTTA 
TGGCTTCCTG GACCCTGTTA 
TGGCTACCTG GGCCCTGCTA 
TGGCTGCCAG GACCCTGTTA 
TGGGTACCCG GGCCTTGCTA 
TGGTTACCTG GACCTTGCTT 
TGGCTACCCG GGCCCTGCTA 
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Fig. 1AAW 



AGACTGTCAA AAAACATAGA 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACAGTGTC 
CGCGTCTCCA CGACAGTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
AGACTGTCAA AAAACATAGA 
AGACTGTCAA AAAACATAGA 
AGACTGTCAA AAGACATAGA 
AGACTGTCAA AAAACATAGA 
CGAGTTTCCA CGACACTGTC 
CGAGTTTCCA CGACACTGTC 
CGAGTTTCCA CGACACTGTC 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACAACCAA 
CGCGTCTCCA CGACACTGTC 
CGCGTCTCCA CGACACTGTC 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
AGAGTATCCA AGACGCTGGA 
CGAATGTCTA AGACGGCTAA 
CGAATGTCTA AGACGGCTAA 
CGAATGTCTA AGACGGCTAA 
CGAATGTCTA AGACGGCTAA 
AGACTGTCAA AAAACATAGA 
CGCGTTTCTA AAACAAAAAC 
CGAGTATCAA AGACATCTGC 
AGACTTTCAA AGACTGCTAA 
CGCGTCTCAA CGACAACCGG 
CGCGTCTCCA CAACCACCAA 
AGAGTCTCCA AAACGCTGGA 
CGCGTCTCCA CGACACTGTC 
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CAGCAACAAC AACAGTAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
CAGCAACAAC AACAGTAACT 
CAGCAACAAC ACCAGTAACT 
CAGCAACAAC AACAGTAACT 
CAGCAACAAC AACAGTAACT 
GCAAAACAAC AACAGCAATT 
GCAAAACAAC AACAGCAATT 
GCAAAACAAC AACAGCAATT 
CCAGAACAAC AACAGCAACT 
CCAGAGCAAC AACAGCAACT 
CCAGAACAAC AACAGCAACT 
CCAGAACAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TCAAAATAAC AACAGCAACT 
TGACAACAAC AACAGTGAAT 
TGACAACAAC AACAGTGAAT 
TGACAACAAC AACAGTGAAT 
TGACAACAAC AACAGTGAAT 
CAGCAACAAC AACAGTAACT 
AGACAACAAC AACAGCAATT 
GGATAACAAC AACAGTGAAT 
CGACAACAAC AACAGTAACT 
GCAAAACAAC AATAGCAACT 
CCAAAATAAC AACAGCAACT 
TCAAAACAAC AACAGCAACT 
GCAAAATAAC AACAGCAACT 
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Fig. 1AAX 

3800 

AAATACCATC TGAATGGTAG AAATTCATTA 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAGTATCACC TGAATGGCAG AGACTCCCTG 
AAGTATCACC TGAATGGCAG AGACTCCCTG 
AAGTATCACC TGAATGGCAG AGACTCCCTG 
AAGTTTAAGC TGAACGGCCG AGACTCTCTA 
AAGTTTAAGC TGAACGGCCG AGACTCTCTA 
AAGTTTAAGC TGAACGGCCG AGACTCTCTA 
AAGTTTAAGC TGAACGGCCG AGACTCTCTA 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
AAATACCATT TAAATGNAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATACCATT TAAATGGAAG AAATTCATTG 
AAATATTACC TGAATGGAAG AAATTCTCTG 
AAATATTACC CGAATGGAAG AAATTCTCTG 
AAATATTACC TGAATGGAAG AAATTCTCTG 
AAATATTACC TGAATGGAAG AAATTCTCTG 
AAATACCATC TGAATGGTAG AAATTCATTA 
AAATATAACC TCAATGGGCG TGAATCCATC 
AAGTACCACC TCAATGGCAG AGACTCTCTG 
AAATATCATC TCAATGGCCG CGACTCGCTG 
AAATACCATC TGAATGGAAG AAATTCATTG 
AAATTCAAGC TGAACGGGAG AGACTCGCTA 
AAATATCACC TGAACGGCAG AAACTCGTTG 
AAGTATCATC TGAATGGCAG AGACTCTCTG 
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Fig. 1AAY 



GCGTAGCCAT GGCCACCAAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GTGTCGCTAT GGCAACGCAC 
GCGTAGCCAT GGCCACCAAC 
GCGTAGCCAT GGCCACCAAC 
GCGTAGCCAT GGCCACCAAC 
GCGTAGCCAT GGCCACCAAC 
GCGTTGCCAT GGCTACCCAC 
GCGTTGCCAT GGCTACCCAC 
GCGTTGCCAT GGCTACCCAC 
GCGTGGCAAT GGCTTCCCAC 
GCGTGGCAAT GGCTTCCCAC 
GCGTGGCAAT GGCTTCCCAC 
GCGTGGCAAT GGCTTCCCAC 
GTGTCGCTAT GGCAACCCAC 
GTGTCGCTAT GGCAACCCAC 
GTGTCGCCAT GGCAACCCAC 
GTGTCGCCAT GGCAACCCAC 
GTGTCGCCAT GGCAACCCAC 
GTGTCGCCAT GGCAACCCAC 
GTGTCGCCAT GGCAACCCAC 
GTGTCGCCAT GGCAACCCAC 
GGCCCCCAAT GGCCAGTCAC 
GGCCCCCAAT GGCCAGTCAC 
GGCCCCCAAT GGCCAGTCAC 
GGCCCCCAGT GGCCAGTCAC 
GCGTAGCCAT GGCCACCAAC 
GCACTGCTAT GGCCTCACAC 

GGCC AT GGCAAGCCAC 

GACCAGCTAT GGCCAGTCAC 
GCATCGCTAT GGCAACACAC 
GCGTGGCTAT GGCATCGCAC 
GCGTCGCCAT GGCAACTCAC 
GTGTCGCTAT GGCAACCCAC 



3850 

AAGGACGACG AGGACCAGTT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGGCGACG AAGAGCGATT 
AAGGGCGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AGGACCAGTT 
AAGGACGACG AGGACCAGTT 
AAGGACGACG AGGACCAGTT 
AAGGACGACG AGGACCAGTT 
AAGGACGACG AGGAGCGCTT 
AAGGACGACG AGGAGCGCTT 
AAGGACGACG AGGAGCGCTT 
AAGGATGACG ACGACCGCTT 
AAGGATGACG ACGACCGCTT 
AAGGATGACG ACGACCGCTT 
AAGGATGACG ACGACCGCTT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AAGAGCGATT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGACG AGGAACGCTT 
AAGGACGATG AGGAAAAGTA 
AAGGACGATG AGGAAAAGTA 
AAGGACGATG AGGAAAAGTA 
AAGGACGATG AGGAAAAGTA 
AAGGACGACG AGGACCAGTT 
AAAGACGACG AAGACAAGTT 
AAGGACGATG AAGAAAAGTT 
AAGGACGATG AAGAAAAATT 
AAAGACGACG AGGAGCGTTT 
AAAGACGACG AGGACCGCTT 
AAGGACGACG AGGACCGCTT 
AAGGACGACG AAGAGCGATT 
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Fig. 1AAZ 



AACGGAGTGC TGGTTTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AGCGGAGTCT TGATGTTTGG 
AACGGAGTGC TGGTTTTTGG 
AACGGAGTGC TGGTTTTTGG 
AACGGAGTGC TGGTTTTTGG 
AACGGAGTGC TGGTTTTTGG 
AGCGGAGTTC TAATGTTTGG 
AGCGGAGTTC TAATGTTTGG 
AGCGGAGTTC TAATGTTTGG 
AGCGGGGTCC TGATTTTTGG 
AGCGGGGTCC TGATTTTTGG 
AGCGGGGTCC TGATTTTTGG 
AGCGGGGTCC TGATTTTTGG 
AGCGGAGTCT TAATGTTTGG 
AGCGGAGTCT TAATGTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
AGCGGAGTTC TAATTTTTGG 
CACGGAAATC TCATCTTTGG 
CACGGAAATC TCATCTTTGG 
CACGGAAATC TCATCTTTGG 
CACGGAAATC TCATCTTTGG 
AACGGAGTGC TGGTTTTTGG 
AGCGGTGTCA TGATTTTTGG 
AGCGGGGTTC TCATCTTTGG 
CACGGCAATC TAATATTTGG 
AACGGGATCC TGATTTTTGG 
AGTGGCGTTC TCATATTTGG 
AGCGGAGTCC TGATTTTTGG 
AGCGGAGTCT TAATGTTTGG 



3900 

CGAAACGGGG GCTGCCAACA 
GAAACAGGGA GCTGGAAA. . 
GAAACAGGGA GCTGGAAA. . 
GAAACAGGGA GCTGGAAA. . 
GAAACAGGGA GCTGGAAA. . 
GAAACAGGGA GCTGGAAA. . 
GAAACAGGGA GCTGGAAA. . 
GAAACAGGGA GCTGGAAA. . 
GAAACAGGGA GCTGGAAA.. 
CAAAACGGGG GCTGCCAACA 
CAAAACGGGG GCTGCCAACA 
CAAAACGGGG GCTGCCAACA 
CAAAACGGGG GCTGCCAACA 
CAAGCAGGGG GCTGGAAA. . 
CAAGCAGGGG GCTGGAAA. . 
CAAGCAGGGG GCTGGAAA.. 
CAAGCAAGGA GCCGGGAA. . 
CAAGCAAGGA GCCGGGAA.. 
CAAGCAAGGA GCCGGGAA.. 
CAAGCAAGGA GCCGGGAA.. 
GAAACAGGGA GCTGGAAA. . 
GAAACAGGGA GCTGGAAA. . 
CAAAACTGGA GCAGCTAATA 
CAAAACTGGA GCAGCTAATA 
CAAAACTGGA GCAGCTAATA 
CAAAACTGGA GCAGCTAATA 
CAAAACTGGA GCAGCTAATA 
CAAAACTGGA GCAGCTAATA 
AAAACAAGGC ACAGGAAC . . 
AAAACAAGGC ACAGGAAC . . 
AAAACAAGGC ACAGGAAC . . 
AAAACAAGGC ACAGGAAC . . 
CAAAACGGGG GCTGCCAACA 
AAAAGAGAGC GCCGGAGC : . 
GAAGCAAGGC TCAGAGAA. . 
CAAAGAAGGG ACAACGGC . . 
CAAACAAAAT GCTGCCAG . . 
CAAGCAAGGA GCCGGGAA. . 
AAAAACTGGA GCAACTAACA 
GAAACAGGGA GCTGGAAA. . 
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Fig. 1AAAA 

3950 

AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AGCGTGATGC TCACCAGCGA AGAAGAAATT 
AGCGTGATGC TCACCAGCGA AGAAGAAATT 
AGCGTGATGC TCACCAGCGA AGAAGAAATT 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
CAAGTGCTGA TTACAGATGA GGAAGAAATC 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
AACGTGCTCA TGACAAATGA AGAAGAAATT 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
TCAGTGCTTA TTACAGACGA AGAAGAAATC 
AACGTGCTAA TGACCAGCGA GGAGGAGATC 
AATGTCATGA TTACAGACGA AGAGGAAATT 
AAGGTCATGA TTACAGACGA AGAGGAAATC 
AATGTAATGA TTACGGATGA AGAAGAGATT 
GATGTCATGC TCACCAGCGA GGAAGAAATC 
CAGGTGCTGA TTACAGATGA GGAAGAAATT 
AATGTGTTAA TGACAAATGA AGAAGAAATT 
AGCGTTATGC TAACCAGTGA GGAAGAAATT 
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Fig. 1AAAB 



ATCCCGTGGC TACAGAAGAA 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCAGTGGC CACAGAACAG 

ATCCCGTGGC TACAGAAGAA 

ATCCCGTGGC TACAGAACAG 

ATCCCGTGGC TACAGAAGAA 

ATCCCGTGGC TACAGAAGAA 

ACCCAGTGGC T AC AG AG C AG 

ACCCAGTGGC TACAGAGCAG 

ACCCAGTGGC TACAGAGCAG 

ACCCCGTGGC CACAGAAGAA 

ACCCCGTGGC CACAGAAGAA 

ACCCCGTGGC CACAGAAGAA 

ACCCCGTGGC CACAGAAGAA 

ACCCAGTGGC CACGGAACAG 

ACCCAGTGGC CACAGAACAG 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ACCCGGTAGC TACCGAGGAA 

ATCCTGTGGC TACAGAACAA 

ATCCTGTGGC TACAGAACAA 

ATCCTGTGGC TACAGAACAA 

ATCCTGTGGC TACAGAACAA 

ATCCCGTGGC TACAGAAGAA 

ACCCTGTGGC CACCGAAAGA 

ATCCCGTGGC TACGGAGCAG 

ATCCTGTGGC AACAGAGCAG 

ACCCTGTGGC TACAGAGGAA 

ACCCTGTAGC CACAGAGGAA 

ATCCTGTAGC CACGGAAGAA 

ACCCAGTGGC CACAGAACAG 



4000 

TACGGTGTGG TCTCCAGCAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TACGGTGTGG TCTCCAGCAA 
TATGGTGTGG TGGCAGACAA 
TATGGTGTGG TGGCAGACAA 
TATGGTGTGG TGGCAGACAA 
TATGGAGCAG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TATGGAGCAG TGGCCATCAA 
TACGGCGTGG TGGCCGATAA 
TACGGCGTGG TGGCCGATAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGGATTG TAAGCAGCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGACAGG TTGCCACCAA 
TACGGTGTGG TCTCCAGCAA 
TTTGGGACCG TGGCAGTCAA 
TATGGTTCTG TATCTACCAA 
TATGGAACTG TGGCAAATAA 
TACGGTATCG TGGCAGATAA 
TACGGAGCAG TGGCCATCAA 
TACGGGATAG TCAGCAGCAA 
TACGGCGTGG TGGCCGATAA 
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Fig. 1AAAC 
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GACCCCAGAC ACAGACTGTC AACAGCCAGG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
CTCCCATTGT GGGAACTGTC AACAGCCAGG 
CTCCCATTGT GGGAACTGTC AACAGCCAGG 
CTCCCATTGT GGGAACTGTC AACAGCCAGG 
AGGCGCAGAC CGGACTCGTG CACAACCAGG 
AGGCGCAGAC CGGACTCGTG CACAACCAGG 
AGGCGCAGAC CGGACTCGTG CACAACCAGG 
AGGCGCAGAC CGGACTCGTG CACAACCAGG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCCCAGAC ACAAGTTGTT AACAACCAGG 
CAGCTTCCTA TGGAAGTGTG GACAGCCAGG 
CAGCTTCCTA TGGAAGTGTG GACAGCCAGG 
CAGCTTCCTA TGGAAGTGTG GACAGCCAGG 
CAGCTTCCTA TGGAAGTGTG GACAGCCAGG 
GACCCCAGAC ACAGACTGTC AACAGCCAGG 
ACCCTGCGAC CGGAGATGTG CATGCTATGG 
AAGCAGCTAC CGCAGATGTC AACACACAAG 
CTCCCACGAC TGGAACTGTC AATCATCAGG 
CTCCTCAAAT TGGAACTGTC AACAGCCAGG 
AGGCGCAAAC TGGACTTGTG CATAACCAGG 
CAGCCCAGAC ACAAGTTGTC AACAACCAGG 
CTCCTATTGT AGGGGCCGTC AACAGTCAAG 
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Fig. 1AAAD 



CGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGCC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
CGGCATGGTC TGGCAGAACC 
TGGTATGGTC TGGCAAAACC 
TGGTATGGTC TGGCAAAACC 
TGGTATGGTC TGGCAAAACC 
CGGCATGGTG TGGCAGAATA 
CGGCATGGTG TGGCAGAATA 
CGGCATGGTG TGGCAGAATA 
CGGCATGGTG TGGCAGAATA 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
TGGAATGGTG TGGCAGGACC 
CGGCATGGTC TGGCAGAACC 
TGGCATGGTG TGGCAAGATA 
AGGCATGGTC TGGCAGGACA 
TGGCATGGTG TGGCAAGATC 
CGGTATGGTC TGGCAGAACC 
TGGTATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 
TGGCATGGTC TGGCAGAACC 



4100 

GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGC 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GGGACGTGTA CCTGCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GCGATGTCTA TCTTCAAGGT 
GGGACGTGTA CCTGCAGGGT 
GAGACGTGTA CCTGCAGGGT 
GAGATGTGTA CCTTCAGGGG 
GTGACGTGTA CCTTCAAGGA 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGC 
GGGACGTGTA CCTGCAGGGT 
GGGACGTGTA CCTGCAGGGT 
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Fig. 1AAAE 



GCCAAAATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCCAAAACTC CTCACACGGA 
GCCAAAACTC CTCACACGGA 
GCCAAAACTC CTCACACGGA 
GCCAAAACTC CTCACACGGA 
GCCAAAATTC CTCACACGGA 
GCCAAAATTC CTCACACAGA 
GCAAAGATTC CACACACGGA 
GCAAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 
GCTAAAATAC CTCACACAGA 
GCCAAGATTC CTCACACGGA 
GCCAAGATTC CTCACACGGA 



4150 

CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CATCCTTCGC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGAAACTTT CATCCCTCGC 
CGGAAACTTT CATCCCTCGC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGCAACTTT CACCCGTCTC 
CGGACACTTT CATCCTTCTC 
CGGACACTTT CATCCTTCTC 
CGGACACTTT CATCCTTCTC 
CGGACACTTT CATCCTTCTC 
CGGCAACTTT CACCCGTCTC 
TGGACACTTT CACCCGTCTC 
CGGACATTTT CACCCCTCTC 
TGGACACTTT CATCCTTCTC 
CGGCAACTTC CACCCGTCTC 
TGGCAACTTT CACCCGTCTC 
TGGCAACTTT CACCCGTCTC 
CGGAAACTTT CATCCCTCGC 
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Fig. 1AAAF 



CGGATTTGGA CTCAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
CGGATTTGGA CTCAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
CGGCTTTGGA CTGAAGCACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
TGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
AGGCTTTGGA CTGAAACACC 
CGGATTTGGA CTCAAACACC 
CGGCTTTGGA CTCAAGAACC 
TGGATTCGGA CTTAAACACC 
AGGCTTTGGA CTGAAACATC 
CGGCTTTGGC CTGAAACATC 
TGGATTTGGA CTGAAACACC 
CGGCTTTGGA CTTAAACATC 
AGGCTTTGGA CTGAAACACC 



4200 

CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA GATCCTGGTG 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTGATT 
CGCCTCCTCA GATCCTGATT 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CGCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CTCCTCCCCA GATCCTGATC 
CGCCTCCTCA AATTCTCATC 
CGCCTCCTCA GATCCTCATC 
CTCCTCCACA GATTCTCATC 
CGCCTCCTCA AATCATGATC 
CTCCGCCTCA GATCCTGATC 
CACCTCCACA GATTCTAATT 
CGCCTCCTCA GATCCTGATC 
CGCCTCCTCA GATCCTGATT 
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CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
CTGTTCCCGC 
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ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
GAGGTGTTTA 
GAGGTGTTTA 
GAGGTGTTTA 
GAGGTGTTTA 
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CTTACCTTCA 
CTTACCTTCA 
CTTACCTTCA 
CTTACCTTCA 
ACTACCTTCA 
ACTACCTTCA 
GAAGTGTTTA 
GAAGTGTTTA 
GAAGTGTTTA 
GAAGTGTTTA 
GAAGTGTTTA 
GAAGTGTTTA 
ACCACTTTCA 
ACCACTTTCA 
ACCACTTTCA 
ACCACTTTCA 
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CTCCTGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
GTCAAGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
GCCAGGCCAA 
GCCAGGCCAA 
GCCAGGCCAA 
ACCAGGCCAA 
ACCAGGCCAA 
ACCAGGCCAA 
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GTCAAGCTAA 
GTCAAGCTAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGCCAA 
CTCCTGGAAA 
CTCCTGGAAA 
CTCCTGGAAA 
CTCCTGGAAA 



CTGTTCCTGC 
CGGTACCTGC 
CGGTACCGGC 
CTGTACCTGC 
CAGTGCCGGC 
CCGTTCCCGC 
CTGTTCCCGC 



GAATCCTCCG 
GAATCCTTCG 
AAATCCTCCG 
GGATCCTCCG 
AGATCCTCCT 
TAATCCTCCG 
GGATCCTCCA 



GCGGAGTTTT 
ACCACCTTCA 
ACGACTTTCA 
ACCACCTTCA 
CTTACCTTCA 
GAGGTGTTTA 
ACTACCTTCA 



CAGCTACAAA 
GTGCGGCAAA 
GCCCGGCCAA 
ACCAGTCAAA 
ATCAAGCCAA 
CTCCTGCCAA 
GTCAAGCTAA 
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4251 

GTTTGCCTCA 

GCTGGCGTCG 
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4300 

TTTATCACGC AGTACAGCAC CGGCCA . GGT CAGCGTGGAG 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTTATCACGC AGTACAGCAC CGGCCA. GGT CAGCGTGGAG 
TTTATCACGC AGTACAGCAC CGGCCA. GGT CAGCGTGGAG 
TTTATCACGC AGTACAGCAC CGGCCA. GGT CAGCGTGGAG 
TTTATCACGC AGTACAGCAC CGGCCA. GGT CAGCGTGGAG 
TTTATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTTATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTTATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGACA. GGT CAGCGTGGAA 
TTCATCACGC AGTACAGCAC CGGGCA . AGT CAGCGTTGAG 
TTCATCACGC AGTACAGCAC CGGGCA. AGT CAGCGTTGAG 
TTCATCACGC AGTACAGCAC CGGGCA. AGT CAGCGTTGAG 
TTCATCACGC AGTACAGCAC CGGGCA. AGT CAGCGTTGAG 
TTCATCACGC AGTACAGCAC CGGGCA. AGT CAGCGTTGAG 
TTCATCACGC AGTACAGCAC CGGGCA. AGT CAGCGTTGAG 
TTCATTACCC AGTATTCCAC CGGACA. GGT CAGCGTGGAA 
TTCATTACCC AGTATTCCAC CGGACA. GGT CAGCGTGGAA 
TTCATTACCC AGTATTCCAC CGGACA. GGT CAGCGTGGAA 
TTCATTACCC AGTATTCCAC CGGACA. GGT CAGCGTGGAA 



GTTTGCTTCA 
GTTTGCTTCC 
GTTTGCTTCA 
GCTGAACTCT 
GCTGAACTCT 
GTTTGCTTCG 
GCTGGCGTCG 



TTCATCACCC 
TTCATCACAC 
TTTATCACTC 
TTCATCACGC 
TTCATCACGC 
TTCATCACAC 
TTCATCACGC 



AATACTCCAC 
AGTACTCCAC 
AGTACTCCAC 
AATACAGCAC 
AGTACAGCAC 
AGTACAGCAC 
AGTACAGCAC 



AGGACA . AGT 
GGGACACGGT 
TGGACA . GGT 
CGGACA . GGT 
GGGACA . AGT 
CGGACA . AGT 
CGGACA. GGT 



GAGTGTGGAA 
CAGCGTGGAG 
CAGCGTGGAA 
CAGCGTGGAA 
CAGCGTGGAA 
CAGCGTGGAA 
CAGCGTGGAA 
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42_2 ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 2__8 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

4 2__15 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_5b ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_lb ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_13 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_3a ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

4 2_4 ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

42_5a ATTGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

4 2_10 ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 2_3b ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

42_11 ATCGAGTGGG AACTGCAGAA AGAGAACAGC AAACGCTGGA ATCCAGAGAT 

42__6b ATCGAGTGGG AACTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

43__1 ATCGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ACCCAGAGAT 

43_5 ATCGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ACCCAGAGAT 

4 3__12 ATCGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ACCCAGAGAT 

4 3_20 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 3_21 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 3_23 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 3_25 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ATCCAGAGAT 

4 4_1 ATTGAATGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAGAT 

4 4_5 ATTGAATGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAGAT 

223_10 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223_2 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223^4 ATCGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223__5 ATCGAATGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223__6 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

223J7 ATCGAGTGGG AGCTGCAGAA AGAGAACAGC AAGCGCTGGA ACCCAGAGAT 

A3_4 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

A3_5 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCGGAAAT 

A3J7 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

A3_3 ATAGAGTGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAAAT 

42 12 

AAV1 ATTGAATGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ATCCCGAAGT 

AAV 2 ATCGAGTGGG AGCTGCAGAA GGAAAACAGC AAACGCTGGA ATCCCGAAAT 

AAV 3 ATTGAGTGGG AGCTACAGAA AGAAAACAGC AAACGTTGGA ATCCAGAGAT 

AAV 8 ATTGAATGGG AGCTGCAGAA GGAAAACAGC AAGCGCTGGA ACCCCGAGAT 

AAV 9 ATCGAGTGGG AGCTGCAGAA AGAAAACAGC AAGCGCTGGA ATCCAGAGAT 

AAV7 ATCGAGTGGG AGCTGCAGAA GGAAAACAGC AAGCGCTGGA ACCCGGAGAT 

4 4_2 ATTGAATGGG AGCTGCAGAA AGAAAACAGC AAACGCTGGA ACCCAGAGAT 
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Fig. 1AAAJ 



TCAAATTATG CCAAGTCTAA 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCAAATTATG CCAAGTCTAA 
TCAAATTATG CCAAGTCTAA 
TCAAATTATG CCAAGTCTAA 
TCAAATTATG CCAAGTCTAA 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTACT ACAAATCTAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTTTG ACAAACAGAC 
TCCAACTACA ACAAGTCGGT 
TCCAACTACA ACAAGTCGGT 
TCCAACTACA ACAAGTCGGT 
TCCAACTACA ACAAGTCGGT 
TCCAACTACT ACAAATCTAC 
TCCAATTATG CAAAATCTGC 
TCCAACTACA ACAAGTCTGT 
TCCAACTACA ACAAGTCTGT 
TCCAACTACT ACAAATCTAC 
TCAAACTACT ACAAATCTAC 
TCCAACTTTG AAAAGCAGAC 
TCCAACTACT ACAAATCTAC 
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TAAT.GTGGA ATTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT .GTGGA CTTTGCTGTC 
AAAT . GTGGA . CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
TAAT.GTGGA ATTTGCTGTC 
TAAT.GTGGA ATTTGCTGTC 
TAAT.GTGGA ATTTGCTGTC 
TAAT.GTGGA ATTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT . GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
AAAT.GTGGA CTTTGCTGTC 
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AAAT.GTGGA CTTTGCTGTT 
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GAAT. GTGGA GTTTACCGTG 
GAAT. GTGGA GTTTACCGTG 
AAAT.GTGGA CTTTGCTGTC 
CAAC . GTTGA TTTTACTGTG 
TAATCGTGGA CTT.ACCGTG 
TAAT.GTGGA CTTTACTGTA 
AAGT. GTGGA CTTTGCTGTT 
AAAT.GTGGA CTTTGCTGTC 
TGGT. GTGGA CTTTGCCGTT 
AAAT.GTGGA CTTTGCTGTT 



88/105 



WO 03/042397 



PCT/US02/33629 



Fig. 1AAAK 



4401 4450 

42_2 -AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 2_8 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 2^15 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_5b AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_lb AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_13 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_3a AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 2_4 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_5a AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 2JL0 AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_3b AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_U AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

42_6b AACAACGAAG GGGTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

43_1 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CTCGTTATCT 

43_5 AATACCGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CTCGTTATCT 

4 3 J. 2 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CTCGTTATCT 

4 3_20 AACACGGAAG GAGTTTATAG CGAGCCTCGC CCCATTGGCA CCCGTTACCT 

43_21 AACACGGAAG GAGTTTATAG CGAGCCTCGC CCCATTGGCA CCCGTTACCT 

43_23 AACACGGAAG GAGTTTATAG CGAGCCTCGC CCCATTGGCA CCCGTTACCT 

43_25 AACACGGAGG GGGTTTATAG CGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 4_1 AACACAGATG GCACTTATTC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

4 4_5 AACACAGATG GCACTTATTC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

223^10 GACAGCCAGG GTGTTTACTC TGAGCCT 

223_2 GACAGCCAGG GTGTTTACTC TGAGCCT [ ] [ 

223_4 GACAGCCAGG GTGTTTACTC TGAGCCT *, 

223_5 GACAGCCAGG GTGTTTACTC TGAGCCT 

223_6 GACAGCCAGG GTGTTTACTC TGAGCCT ' 

223_7 GACAGCCAGG GTGTTTACTC TGAGCCT 

A3_4 GACGCAAACG GTGTTTATTC TGAACCCCGC CCTATTGGCA CTCGTTACCT 

A3__5 GACGCAAACG GTGTTTATTC TGAACCCCGC CCTATTGGCA CTCGTTACCT 

A3_7 GACGCAAACG GTGTTTATTC TGAACCCCGC CCTATTGGCA CTCGTTACCT 

A3_3 GACGCAAACG GTGTTTATTC TGAACCCCGC CCTATTGGCA CTCGTTACCT 

42_12 AATACTGAGG GTACTTATTC AGAGCCTCGC CCCATTGGCA CCCGTTACCT 

AAV1 GACAACAATG GACTTTATAC TGAGCCTCGC CCCATTGGCA CCCGTTACCT 

AAV 2 GATACTAATG GCGTGTATTC AGAGCCTCGC CCCATTGGCA CCAGATACCT 

AAV 3 GACACTAATG GTGTTTATAG TGAACCTCGC CCTATTGGAA CCCGGTATCT 

AAV 8 AATACAGAAG GCGTGTACTC TGAACCCCGC CCCATTGGCA CCCGTTACCT 

AAV 9 AATACCGAAG GTGTTTACTC TGAGCCTCGC CCCATTGGTA CTCGTTACCT 

AAV 7 GACAGCCAGG GTGTTTACTC TGAGCCTCGC CCTATTGGCA CTCGTTACCT 

44_2 AACACAGATG GCACTTATTC TGAGCCTCGC CCCATCGGCA CCCGTTACCT 
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Fig. 1AAAL 

4451 4500 
VP1-3 stop Poly A s ignal 

42_2 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

4 2_8 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGC TAATTCGTTT 

4 2 m 15 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

42_5b CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

4 2_lb CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 

42_13 CACCCGTAGC CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 

42_3a CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

4 2_4 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

4 2_5a CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

42_10 CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

42_3b CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

42_11 CACCCGTAAC CTGTAATTAC TTGTTAATCA ATAAACCGGT TGATTCGTTT 

4 2_6b CACCCGTAAC CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 

4 3_1 CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT 

4 3_5 CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TAATTCGTTT 

4 3_12 CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TAATTCGTTT 

43_20 CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 

4 3_21 CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 

4 3_23 CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 

4 3_25 CACCCGCAAC CTGTAATTAC ATGTTAATCA ATAAACCGGT TAATTCGTTT 

4 4JL CACCCGTAAT CTGTAATTGC TCGTTAATCA ATAAACCGGT TGATTCGTTT 

4 4_5 CACCCGTAAT CTGTAATTGC TTGTTAATCA ATAAACCGGT TGATTCGTTT 

223_10 

223_2 

223__4 

223_5 

223^6 

223J7 

A3_4 TACCCGGAAC 

A3_5 TACCCGGAAC 

A3J7 TACCCGGAAC 

A3_3 TACCCGGAAC 

42_12 CACCCGTAAC 

AAV1 TACCCGTCCC 

AAV2 GACTCGTAAT 

AAV 3 CACACGAAAC 

AAV8 CACCCGTAAT 

AAV 9 CACCCGTAAT 

AAV 7 CACCCGTAAT 

44 2 CACCCGTAAT 



TTGTAATTTC CTGTTAATGA ATAAACCGAT TTATGCGTTT 
TTGTAATTTC CTGTTAATGA ATAAACCGAT TTATGCGTTT 
TTGTAATTTC CTGTTAATGA ATAAACCGAT TTATGCGTTT 
TTGTAATTTC CTGTTAATGA ATAAGCCGAT TTATGCGTTT 
CTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 
CTGTAATTAC GTGTTAATCA ATAAACCGGT TGATTCGTTT 
CTGTAATTGC TTGTTAATCA ATAAACCGTT TAATTCGTTT 
TTGTGAATCC TGGTTAATCA ATAAACCGTT TAATTCGTTT 
CTGTAATTGC CTGTTAATCA ATAAACCGGT TGATTCGTTT 
TTGTAATTGC CTGTTAATCA ATAAACCGGT TAATTCGTTT 
CTGTAATTGC ATGTTAATCA ATAAACCGGT TGATTCGTTT 
CTG TAA TTGC TTGTTAAT CA ATAAA CCGGT TGATTCGTTT 
vpl-3 stop PolyA signal 
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Fig. 1AAAM 
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CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTCGTTTA AACCTGCAGG 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC. T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC. T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 



CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTCGC . GG CCGCTA. 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC. T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTCGT . TT AAACCT 

CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 

CAGTTGAACT TTGGTCTCCT GTCCTTCTTA TCTTATCGGT TACCATGGTT 
CAGTTGAACT TTGGTCTC. T GCGTATTTCT ..TTCTT.AT CTAGTTTCCA 
CAGTTGAACT TTGGCTCT . T GTGCACTTCT TTATCTTTAT CTTGTTTCCA 

CAGTTGAACT TTGGTCTC . T GCG 

CAGTTGAACT TTGGTCTC . T GCG 

CAGTTGAACT TTGGTCTCCT GTGCTTCTTA TCTTATCGGT TTCCATAGCA 
CAGTTGAACT TTGGTCTC . T GCGAAGGGCG AATTC 
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Fig. 1AAAN 
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Fig. 1AAAO 
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. CTTGGC GTAATCATGG GTCATAG. 



TCGCGA TAAAAGACTT ACGTCATCGG GTTACCCCTA GTGATGGAGT 
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TACAACTGCT GGTTAATATT TAACTCTCGC CATACCTCTA GTGATGGAGT 



AAV" 7 : GTCACCGC GGTACCCCTA GTGATGGAGT 
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Fig. 1AAAP 
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Fig. 1AAAQ 
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Fig. 1AAAR 
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Fig. 2A 
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Fig. 2B 
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Fig. 2C 



. ooooSSoSooSSoSooooooaoooooooooooioiooooooiooooooiooaooooaooc" 



« oooaooiaoooDOoaoooooaoooocioooaoooiv,- . ^ 




, „ „ ^ooooooooooo 

PRPRPPPPPQC — 

MHHMMHMMMHHnrir-<ni-i»-« p ' 

mi 




JHHQQQQQQQQQqnqqnnnuoM 
» oSoooaooooooooooioiciooiaaooaaoooooiooioaociaaaaoOiaaoaooaoooaao • 

.$232232222332922339939232229222^ 

.kkfcbEoQOQQqQOOfinoQpnqoopQQqpqpprjattqpoqOQpQ • 
onhwwwwnwG2wm03www ■ 

"SSSoSaoooaaaooooooooaoooacPOOoooaooaooooooooaooo'OioooaoaoKv 

• n m inuui w w w w « w tn u to w w ra iq h ino w w ini w u w w iut w w w « ii ui it it w w tn tuui w uj w w w w W w ui • 

:&fc&&^&S&Bep&&&ppEpp&&pppp^^ 

HHHH«y«yyyi'y!r'y.^i', - >>>>>>>>>^vJqq^p>>>>?C>^>>f>>>>H - 

oooaoaoooooooociaacioaoaaoooo;'. 

to w w u 

.... „ „ ... _ w » rf w ^ ^ w « .... " w w w © © © D 

unnbUDDQUDOUUbbboOUOODbubbbOOOUOOOUOOOUOOUOUQDUUv, 

. oooooooSoaSooSooooaaooaooaaoaaaoooooaaoooioowooooooaoacioor) 

« CU ^ ^ ^ CV» CU Cl4 C3i« CW ^ ^ CW Cm £»« CV« PLt ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 1 

♦ Sooou©ooouob©oo©©o©©©»o©©©© • 
NWWNoaooaaoaaooooooaaoooaooooooooooooaoo • 

.gggS33 2 2 52 3593 93 393 92 99539993399 9 222 5 5 3 8 329 99 229 2 2 239 32 S : 

Jlilggggggllgggggggggggggglgggglgggggggggggllggggggggpr 

MWwwtowioMMMinwwtowwojwaiWwwoiwiowinMWtaineaww 
^aoooaaooooobSooooooooooaoaoaooooooooSaooooooooooa^ 

••>>>>>>>>>>>>HHto>>>>>MMHMHWMHWMHHHMHHW>>>>>MMHHMMHH>}>&HMJ> 



• z 2 2 2 in til n to u v) ci) tn t ------ - .- - .. — — ... ... ..... .••> .... — .... ... ... . 

hPPP>>>>>>>>>: 

• is £ a 5oooooooooiDoaooooioooooaaoo( - 

• cyQOOnnnMUMuiinuuviiiitAtnunuviiouiovitatfltfliAt/iuitniQtAi 

i i i » i i i i i i i i i i i » I t I 

H [C HI H I I * * I I M I I I I I M t I I I I I I I M I I I M I 

.obbo»oo©©QDbo©obo©bb 

&&i:t:giSSIgSgSir — — - - 

X >* ^ JH JM Jm i 



WMHH?>HHHHHb?? 

•'jtiUDobuobooobbl 

sggaggggggggggj 



3 13 a ^ t 



• ooooctoaooooooob&o 



jbSbnuoooobbubouoooouoooDubBI 

?8ggigg»igggs§ggggggeeigKiiiej 

^PhPopoppppoophoociippohHHFH'H 



ggggggggggg^gggggggggggggggggggg^ggg^g 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaasaaaaa 




• MMMMMHMMMHH MM HHH H H M H M M MMMMMMr?i-««r«»-«nn t m v-f , »i -l l"l n r- i-i i-i M i-i p 1 , i n % i 

« OOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOPOOOOOOPOOOOOOO 



_ J Js i £ i L - 

^MMMMMMMMMMMMI 



OOOUOOOUOOOtSOOH 



SHE": 



> I I I |> > M > ! |M I I | | | [H M H IHrt tr> M Vrt M .2HMMHnMMr»MW»Ma 1) 



99/105 



WO 03/042397 



PCT/US02/33629 



Fig. 2D 
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Fig, 2E 
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Fig. 2F 
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C2, SEQ ID NO:61 ; C5, SEQ ID NO:62; A3-3, SEQ ID NO:66; A3-7, SEQ ID NO:67; 
A3-4, SEQ ID NO:68; A3-5, SEQ ID NO: 69; 3.3b, SEQ ID NO: 62; 223.4, SEQ ID NO: 
73; 223-5, SEQ ID NO:74; 223-10, SEQ ID NO:75; 223-2, SEQ ID NO:76; 223-7, SEQ ID 
NO: 77; 223-6, SEQ ID NO: 78; 44-1 , SEQ ID NO: 79; 44-5, SEQ ID NO:80; 44-2, SEQ ID 
NO:81 ; 42-15, SEQ ID NO: 84; 42-8, SEQ ID NO: 85; 42-13, SEQ ID NO:86; 42-3A, SEQ 
ID NO:87; 42-4, SEQ ID NO:88; 42-5A, SEQ ID NO:89; 42-1B, SEQ ID NO:90; 42-5B, 
SEQ ID NO:91 ; 43-1, SEQ ID NO: 92; 43-12, SEQ ID NO: 93; 43-5, SEQ ID NO:94; 43- 
21, SEQ ID NO:96; 43-25, SEQ ID NO: 97; 43-20, SEQ ID NO:99; 24.1, SEQ ID NO: 101 ; 
42.2, SEQ ID NO:102; 7.2, SEQ ID NO: 103; 27.3, SEQ ID NO: 104; 16.3, SEQ ID NO: 
105; 42.10, SEQ ID NO: 106; 42-3B, SEQ ID NO: 107; 42-11, SEQ ID NO: 108; Fl, SEQ 
ID NO: 1 09; F5, SEQ ID NO: 1 10; F3, SEQ ED NO:l 1 1 ; 42-6B, SEQ ID NO: 11 2; and 42- 
12, SEQ ID NO: 113. 

30. A molecule comprising a nucleic acid sequence encoding a novel adeno- 
associated virus (AAV) serotype capsid protein, said nucleic acid sequence selected from the 
group consisting of: AAV5, SEQ ID NO:2; 42-2 , SEQ ID NO:9; 42-8 ,SEQ ID NO:27; 42- 
15 , SEQ ID NO:28; 42-5b ,SEQ ID NO: 29; 42-lb ,SEQ ID NO:30; 42-13, SEQ ID NO: 
31 ; 42-3a , SEQ ID NO: 32; 42-4 ,SEQ ID NO:33; 42-5a , SEQ ID NO: 34; 42-10, SEQ ID 
NO:35; 42-3b, SEQ ID NO: 36; 42-1 1, SEQ ID NO: 37; 42-6b, SEQ ID NO:38; 43-1 , SEQ 
ID NO: 39; 43-5, SEQ ID NO: 40; 43-12 ,SEQ ID NO:41 ; 43-20, SEQ ID NO:42; 43-21, 
SEQ ID NO: 43; 43-23, SEQ ID NO:44; 43-25 ,SEQ ID NO: 45; 44.1, SEQ ID NO:47; 
44.5, SEQ ID NO:47; 223.10, SEQ ID NO:48; 223.2 , SEQ ID NO:49; 223.4, SEQ ID 
NO:50; 223.5, SEQ ID NO: 51; 223.6, SEQ ID NO: 52; 223.7, SEQ ID NO: 53; A3.4, SEQ 
ID NO: 54; A3.5, SEQ ID NO:55; A3.7 , SEQ ID NO: 56; A3.3 ,SEQ ID NO:57; 42.12, 
SEQ ID NO: 58; 44.2, SEQ ID NO: 59; AAV10 , SEQ ID NO: 1 17; AAV11, SEQ ID NO: 
118; AAV 12 ,SEQ ID NO:l 19; A3.1, SEQ ID NO:120; and H6, SEQ ID NO: 25. 

31. A molecule comprising a nucleic acid sequence encoding a fragment of an 
adeno-associated virus capsid protein, said nucleic acid sequence selected from the group 
consisting of: 

vpl.nt 825 to 3049; 
vp2, nt 1234 to 3049; 
vp 3, nt 1434 to 3049; 
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nt 468 to 3090; and 
nt 725 to 3090, 

wherein the nucleotides numbers are of AAV7, SEQ ID NO:l and 
correspond to sequences in 42-2 , SEQ ID NO:9; 42-8 ,SEQ ID NO:27; 42-15 , SEQ ID 
NO:28; 42-5b ,SEQ ID NO: 29; 42- lb ,SEQ rD NO:30; 42-13, SEQ ID NO: 31; 42-3a , 
SEQ ID NO: 32; 42-4 ,SEQ ID NO:33; 42-5a , SEQ ID NO: 34; 42-10, SEQ ID NO:35; 42- 
3b, SEQ ID NO: 36; 42-1 1, SEQ ID NO: 37; 42-6b, SEQ ID NO:38; 43-1, SEQ ID NO: 39; 
43-5, SEQ ID NO: 40; 43-12 ,SEQ ID NO:41 ; 43-20, SEQ ID NO:42; 43-21, SEQ ID NO: 
43; 43-23, SEQ IDNO:44; 43-25 ,SEQ ID NO: 45; 44.1, SEQ IDNO:47; 44.5 5 SEQ ID 
NO:47; 223.10, SEQ ID NO:48; 223.2 , SEQ ID NO:49; 223.4, SEQ ID NO:50; 223.5, SEQ 
ID NO: 51; 223.6, SEQ ID NO: 52; 223.7, SEQ ID NO: 53; A3.4, SEQ ID NO: 54; A3.5, 
SEQ ID NO:55; A3.7 , SEQ ID NO: 56; A3.3 ,SEQ ID NO:57; 42.12, SEQ ID NO: 58; 
44.2, SEQ ID NO: 59; AAV10 , SEQ ID NO: 1 17; AAV11, SEQ ID NO: 118; AAV12 
,SEQ ID NO: 1 19; A3. 1, SEQ ID NO: 120; and H6, SEQ ID NO: 25. 

32. The molecule according to claim 26 or claim 28 to 3 1, wherein said 
molecule is a plasmid. 

33. The molecule according to claim 26 or claim 28 to 3 1 , wherein said 
molecule further comprises a functional AAV rep gene. 

34. The molecule according to claim 30, wherein said nucleic acid sequence is 
the AAV7 sequence, SEQ ID NO:L 

35. A method of generating a recombinant adeno-associated virus (AAV) 
comprising an AAV serotype capsid comprising the steps of culturing a host cell containing 
(a) a molecule according to any of claims 26 or claim 28 to 31 which encodes an adeno- 
associated virus capsid; (b) a functional rep gene; (c) a minigene comprising AAV inverted 
terminal repeats (ITRs) and a transgene; and (d) sufficient helper functions to permit 
packaging of the minigene into the AAV capsid protein. 

36. A host cell transfected with an adeno-associated virus according to claim 24 
a molecule according to any of claims 26, or claims 28 to 3 1 . 
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37. A composition comprising an AAV according to claim 24 or claim 27, and a 
physiologically compatible carrier. 

38. A composition comprising a molecule according to any of claims 26, or claims 
28 to 31 and a physiologically compatible carrier. 

39. A method of delivering a transgene to a cell, said method comprising the step of 
contacting the cell with an AAV according to claim 24 or claim 27, wherein said rAAV 
comprises the transgene. 

40. A molecule comprising a heterologous adeno-associated virus (AAV) 
serotype 7 nucleic acid sequence, said sequence comprising: 

nucleotides (nt) 1 to 107 of SEQ ID NO: 1; 
nt 107 to 2215 of SEQ ID NO:l; 
nt 334 to 221 5 of SEQ ID NO: 1 ; 
nt 2222 to 4435 of SEQ ID NO: 1 . 
nt 2633 to 4435 of SEQ ID NO:] ; 
nt2831 to 4435 of SEQ ID NO: 1; and 
nt 4704 to 472 1 of SEQ ID NO: 1 . 

41 . A molecule encoding an adeno-associated virus (AAV) serotype 7 rep 
protein or a fragment thereof, said protein or fragment selected from the group consisting of. 

amino acid (aa) 1 to 623, aa 1 to 171 ; aa 172 to 372, aa 373 to 444, and aa 
445 to 623 of SEQ ID NO:3. 

42. A host cell containing a molecule according to claim 40 or 41 . 
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TTCTCGGCGG 


CAGCAAGGTG 


CGCGTGGACC 





CGTGGAGTCC 
GCCGTTGCAG 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 



GCCAAGGCCA 
GACCGGATGT 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 



TTCTCGGCGG 
TCAAATTTGA 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 



CAGCAAGGTG 
ACTCACCCGC 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAAGTG 
CAGCAAAGTG 



CGCGTGGACC 
CGTCTGGAGC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGCGTGGACC 
CGCGTGGACC 



CGTGGAATCT 
CGTGGAATCT 
CGTGGAATCT 
CGTGGAATCT 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCG 
CGTGGAGAGC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 
CGTGGAGTCC 



GCCAAAGCCA 
GCCAAAGCCA 
GCCAAAGCCA 
GCCAAAGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAAGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 
GCCAAGGCCA 



TTCTGGGTGG 
TTCTGGGTGG 
TTCTGGGTGG 
TTCTGGGTGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGAGG 
TTCTGGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 
TTCTCGGCGG 



AAGCAAGGTT 
AAGCAAGGTT 
AAGCAAGGTT 
AGGCAAGGTT 
CAGCAAGGTG 
CAGCAAGGTG 
AAGCAAGGTG 
AAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAGGTG 
CAGCAAAGTG 



CGTGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGTGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
CGCGTGGACC 
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AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 


GTCTTCCGCC 
GTCTTCCGCC 
GTCGTCCGCC 
GTCGTCCGCC 


CAGATCGATC 
CAGATCGATC 
CAGATCGACC 
CAGATCGACC 


CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT 


1600 
GATCGTCACT 
GATCGTCACT 
GATCGTCACC 
GATCGTCACC 


AAAAGTGCAA 
AAAAGTGCAA 


GTCGTCCGCC 
GTCGTCCGCC 


CAGATCGATC 
CAGATCGATC 


CCACCCCCGT 
CCACCCCCGT 


GATCGTCACT 
GATCGTCACT 


AAAAGTGCAA 


GTCGTCCGCC 


CAGATCGACC 


CCACCCCCGT 


GATCGTCACC 





AAAAGTGCAA 
ATGACTTTGG 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 
AAAAGTGCAA 



GTCTTCCGCC 
CAAGGTGACA 
GTCGTCCGCC 
GTCGTCCGCC 
GTCGTCCGCC 
GTCTTCCGCC 
GTCTTCCGCC 
GTCTTCCGCC 
GTCTTCCGCC 
GCCGTCCGCC 
GTCGTCCGCC 



CAGATCGATC 
AAGCAGGAAG 
CAGATCGACC 
CAGATCGACC 
CAGATCGACC 
CAGATCGATC 
CAGATCGATC 
CAGATCGATC 
CAGATCGATC 
CAGATCGACC 
CAGATCGACC 



CCACCCCCGT 
TCAAAGAGTT 
CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT 
CCACCCCCGT- 



GATCGTCACT 
CTTCCGCTGG 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 
GATCGTCACC 



AGAAATGCAA GTCTTCGGCC CAGATCGACC CGACTCCGGT GATTGTCACC 

AGAAATGCAA GTCTTCGGCC CAGATCGACC CGACTCCGGT GATTGTCACC 

AGAAATGCAG GTCTTCGGCC CAGATCGACC CGACTCCGGT GATTGTCACC 

AGAAATGCAA GTCTTCGGCC CAGATCGACC CGACTCCGGT GATTGTCACC 

AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

AGAAATGCAA GTCCTCGGCC C AG AT AG AC C CGACTCCCGT GATCGTCACC 

AAAAGTGCAA GTCATCGGCC CAGATCGAAC CCACTCCCGT GATCGTCACC 

AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACTCCCGT GATCGTCACC 

AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 

AAAAGTGCAA GTCGTCCGCC CAGATCGACC CCACCCCCGT GATCGTCACC 
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Fig. 1AG 
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4 22 TCCAACACCA ACATGTGCGC TGTGATTGAC GGGAACAGCA CCACCTTCGA 

« 8 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA cScOTCGA 

42_ 5 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA ScScGA 

42-lb TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACC^CGA 

42_13 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42_3a TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScGA 

42_5a TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42~3b ......[[ 

l^ CACCR ACATGTGC GC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

42_6b GCGCAGGATC ACGTGACCGA GGTGGCGCAT GAGTTCTACG TCAGAAArrr 

43_1 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGaIcSca CCACCTTCGA 

A ll lr C ^ CACCR ^GATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScGA 

4 3 12 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

43 20 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCG CCACC'ScGA 

43 2 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACC^CGA 

43_23 TCCAACACCA ACATGTGCGC CGTGATTGAC' GGGAACAGCA CCACC^CGA 

43_25 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACC1TCGA 

4 4_1 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACCTTCGA 

223 10 TCCAACACCA ^TGTGCGC CGTGATTGAC GGGAACAGCA CCAcScGA 

223 2 ' ' ." ' [[[[] 

223_4 

223_5 

223 6 .' ' 

223 7 

A3_4 TCTAACACCA ACATGTGCGC CGTGATTGAC GGAAACTCGA CCACCTTCCA 

A3_5 TCTAACACCA ACATGTGCGC CGTGATTGAC GGAAACTCGA CCACcScGA 

A _ TCTAACACCA ACATGTGCGC CGTGATTGAC GGAAACTCGA CCACCTTCGA 

A3_3 TCTAACACCA ACATGTGCGC CGTGATTGAC GGAAACTCGA CCACCTTCGA 

42_12 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScGA 

AAV1 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScGA 

AAV2 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACTCAA CGACC^CGA 

AAV3 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScGA 

AAV8 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScGA 

AAV9 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScS 

AAV7 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScGA 

4 4_2 TCCAACACCA ACATGTGCGC CGTGATTGAC GGGAACAGCA CCACcScGA 
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GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 


CCGTTACAAG 
CCGTTACAAG 
CCGTTGCAGG 
CCGTTACAAG 


ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 


CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 


1700 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 


GCACCAGCAG 
GCACCAGCAG 


CCGTTACAAG 
CCGTTACAAG 


ACCGGATGTT 
ACCGGATGTT 


CAAATTTGAA 
CAAATTTGAA 


CTCACCCGCC 
CTCACCCGCC 


GCACCAGCAG 


CCGTTGCAGG 


ACCGGATGTT 


CAAATTTGAA 


CTCACCCGCC 





GCACCAGCAG CCGTTACAAG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

TGGAGCCAAC AAGAGACCCG CCCCCGATGA CGCGGATAAA AGCGAGCCCA 

GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTCGAA CTCACCCGCC 

GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTCGAA CTCACCCGCC 

GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTCGAA CTCACCCGCC 

GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAATTTGAA CTCACCCGCC 

GCACCAGCAG CCGTTGCGGG ACCGGATGTT CAAGTTTGAA CTCACCCGCC 

GCACCAGCAG CCGTTGCAGG ACCGGATGTT CAAGTTTGAA CTCACCCGCC 



GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
ACACCAGCAG 
GCATCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 
GCACCAGCAG 



CCGTTGCAAG 
CCGTTGCAAG 
CCGTTGCAAG 
CCGTTGCAAG 
CCGTTACAAG 
CCGTTGCAGG 
CCGTTGCAAG 
CCGCTGCAGG 
CCTCTCCAGG 
CCTCTCCAGG 
CCGTTGCAGG 
CCGTTGCAGG 



ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 
ACCGGATGTT 



CAAATTTGAA 
CAAATTTGAA 
•CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
CAAATTTGAA 
TGAATTTGAA 
TAAGTTCGAA 
TAAGTTCGAA 
CAAATTTGAA 
CAAGTTTGAA 



CTTACCCGCC 
CTTACCCGCC 
CTTACCCGCC 
CTTACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
CTTACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
CTCACCCGCC 
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Fig. 1AI 
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42_2 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

42_8 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

4 2_15 GTCTGGAGCA TGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

4 2_5b GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

4 2_lb 

42_13 GTCTGGAGCA TGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

4 2_3a GTCTGGAGCA TGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

42__5a GTCTGGAGCA TGACTTTGGC AAGGCGACAA AGCAGGAAGT CAAAGAGTTC 

4 2~3b .......... . . . 

42_11 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

42_6b AGCGGGCCTG CCCCTCAGTC GCGGATCCAT CGACGTCAGA CGCGGAAGGA 

4 3_1 GTCTGGAGCA CGACTTTGGC AAGGTGACCA AGCAGGAAGT CAAAGAGTTC 

4 3_5 GTCTGGAGCA CGACTTTGGC AAGGTGACCA AGCAGGAAGT CAAAGAGTTC 

4 3_12 GTCTGGAGCA CGACTTTGGC AAGGTGACCA AGCAGGAAGT CAAAGAGTTC 

43_20 GTCTGGAGCA TGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGAGTTC 

43_21 GTCTGGAGCA TGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGAGTTC 

43^23 GTCTGGAGCA TGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGAGTTC 

43_25 GTCTGGAGCA TGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGGGTTC 

44_1 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAGAGAGTTC 

223 \o GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAGAGAGTTC 

223 2 [ 

223_4 [] [ 

223_5 '] 

223 6 [] 

223 7 .......... 

A3_4 GTTTGGATCA TGACTTTGGG AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

A3_5 GTTTGGATCA TGACTTTGGG AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

A3_7 GTTTGGATCA TGACTTTGGG AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

A3_3 GTTTGGATCA TGACTTTGGG AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

42_12 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

AAV1 GTCTGGAGCA TGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

AAV2 GTCTGGATCA TGACTTTGGG AAGGTCACCA AGCAGGAAGT CAAAGACTTT 

AAV3 GTTTGGACCA TGACTTTGGG AAGGTCACCA AACAGGAAGT AAAGGACTTT 

AAV8 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

AAV9 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAAAGAGTTC 

AAV7 GTCTGGAGCA CGACTTTGGC AAGGTGACGA AGCAGGAAGT CAAAGAGTTC 

4 4_2 GTCTGGAGCA CGACTTTGGC AAGGTGACAA AGCAGGAAGT CAGAGAGTTC 
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Fig. 1AJ 
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42_2 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_8 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_1S TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_5b TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4p3 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_3a TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4 2 5a TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42 10 

42~3b .......... .......] 

42_11 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

42_6b GCTCCGGTGG ACTTTGCCGA CAGGTACCAA AACAAATGTT CTCGTCACGC 

43_1 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

43 5 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

43_12 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

43_20 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCCACGT 

43 21 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCCACGT 
43_23 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCCACGT 
43_25 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCCACGT 

44_1 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCACG AGTTCTACGT 

223 4 10 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCACG AGTTCTACGT 

223_2 

223_4 

223_5 ..." 

223_6 ] / ' ' ' ' 

223_7 

A3_4 TTCCGGTGGG CTCAAGATCA CGTGACTGAG GTGGAGCATG AGTTCTACGT 

A3_5 TTCCGGTGGG CTCAAGATCA CGTGACTGAG GTGGAGCATG AGTTCTACGT 

A3_7 TTCCGGTGGG CTCAAGATCA CGTGACTGAG GTGGAGCATG AGTTCTACGT 

A3_3 TTCCGGTGGG CTCAAGATCA CGTGACTGAG GTGGAGCATG AGTTCTACGT 

4 2_12 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

AAV1 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

AAV2 TTCCGGTGGG CAAAGGATCA CGTGGTTGAG GTGGAGCATG AATTCTACGT 

AAV3 TTCCGGTGGG CTTCCGATCA CGTGACTGAC GTGGCTCATG AGTTCTACGT 

AAV8 TTCCGCTGGG CCAGTGATCA CGTGACCGAG GTGGCGCATG AGTTTTACGT 

AAV 9 TTCCGCTGGG CCAGTGATCA CGTGACCGAG GTGGCGCATG AGTTTTACGT 

AAV7 TTCCGCTGGG CCAGTGATCA CGTGACCGAG GTGGCGCATG AGTTCTACGT 

4 4_2 TTCCGCTGGG CGCAGGATCA CGTGACCGAG GTGGCGCACG AGTTCTACGT 
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P4 0/TATA 



CAGAAAGGGT 
CAGAAAGGGT 
CAGAAAGGGT 


bbAbLLAACA 
GGAGCCAACA 
GGAGCCAACA 
GGAGCCAACA 


AGAGACCCGC 
AGAGACCCGC 
AGAGACCCGC 
AGAGACCCGC 


CCCCGATGAC 
CCCCGATGAC 
CCCCGATGAC 
CCCCGATGAC 


GCGGATAAAA 
GCGGATAAAA 
GCGGATAAAA 
GCGGATAAAA 


CAGAAAGGGT 
CAGAAAGGGT 


GGAGCCAACA 
GGAGCCAACA 


AGAGACCCGC 
AGAGACCCGC 


CCCCGATGAC 
CCCCGATGAC 


GCGGATAAAA 
GCGGATAAAA 


CAGAAAGGGT 


GGAGCCAACA 


AGAGACCCGC 


CCCCGATGAC 


GCGGATAAAA 





CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 
GGGCATAGCG CTGACGTAAA TCACGTCATA GGGGAGTGGT CCTGTATTAG 
CAGAAAGGGC GGAGCCAGCA AAAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGC GGAGCCAGCA AAAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGC GGAGCCAGCA AAAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGCGCCAACA AGAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATATAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 
CAGAAAGGGT GGAGCCAACA AGAGACCCGC CCCCGATGAC GCGGATAAAA 



CAAAAAGGGT 
CAAAAAGGGT 
CAAAAAGGGT 
CAAAAAGGGT 
CAGAAAGGGT 
CAGAAAGGGT 
CAAAAAGGGT 
CAGAAAGGGT 
CAGAAAGGGC 
CAGAAAGGGC 
CAGAAAGGGC 
CAGAAAGGGT 



GGAGCCAAGA 
GGAGCCAAGA 
GGAGCCAAGA 
GGAGCCAAGA 
GGAGCCAACA 
GGAGCCAACA 
GGAGCCAAGA 
GGAGCTAAGA 
GGAGCCAGCA 
GGAGCCAGCA 
GGAGCCAGCA 
GGAGCCAACA 



AAAGGCCCGC 
AAAGGCCCGC 
AAAGGCCCGC 
AAAGGCCCGC 
AGAGACCCGC 
AAAGACCCGC 
AAAGACCCGC 
AACGCCCCGC 
AAAGACCCGC 
AAAGACCCGC 
AAAGACCCGC 
AGAGACCCGC 



CCCCGATGAT 
CCCCGATGAT 
CCCCGATGAT 
CCCCGATGAT 
CCCCGATGAC 
CCCCGATGAC 
CCCCAGTGAC 
CTCCAATGAC 
CCCCGATGAC 
CCCCGATGAC 
CCCCGATGAC 
CCCCGATGAC 



GTATATATAA 
GTATATATAA 
GTATATATAA 
GTATATATAA 
GCGGATAAAA 
GCGGATAAAA 
GCAGATATAA 
GCGGATGTAA 
GCGGATAAAA 
GCGGATAAAA 

gcggaItataa 

GCGGA r 



'AAAA 
P4 0/TATA 
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